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failures. The second phase of the distributed precomputation 
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demands in order to free up capacity requiredjor_anoth^^ 
demand, so as to gptiraiz^j^veralllnetwofk^c^acity uti^^ 
^tipn^O " 
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—376 



SEND PROBE MESSAGES ON SELECTED (ROUTE. 
WAVELENGTH) PAIRS, AND OBTAIN FC^. X) 
378 — I VALUES FOR ALL LINKS ON THE PRIMARY AND 
FC{i, \i) VALUES FOR ALL LINKS ON THE 
RESTORATION PATHS. UPDATE G VALUES 



380 — 



CALCULATE J^^? FC(i, X) = fc(r, X) AND 
5^^g FC(i, ^) = fc(s, n) FOR ALL 
SELECTED ROUTE PAIRS. 
LET fc(r, X, s. \i) - min {fc(r, X), fc(s, ^)} 

LETfcsel= (r.xS. ^)^^'''^'^'^) 
SELECTED 



FIG. 17A 
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DISTRTOUTED PRECOMPUTATION OF 
NETWORK SIGNAL PATHS WITH TABLE- 
BASED LINK CAPACITY CONTROL 

This application is a contiDuatioo of application Ser. No, 
08/960,569, filed Oct. 29, 1997, now U.S. Pat. No. 6,021, 
113. 

RELATED APPLICATIONS 

The present application is related lo U.S. Patent Applica- 
tions entitled "Distributed Precomputation of Signal Paths in 
an Optical Network," "Distributed Precomputation of Net- 
work Signal Paths with Improved Performance Through 
Parallelization," "Multi-Phase Process for Distributed Pre- 
computation of Network Signal Paths," and "Hybrid 
Centralized/Distributed Precomputation of Network Signal 
Paths," each filed concurrently herewith in the names of 
inventors Bharat T. Doshi, Subrahmanyam Dravida, P. Har- 
shavardhana and Yufei Wang, and assigned to the assignee 
of the present application. 

FIELD OF THE INVENTION 

The present invention relates generally to techniques for 
restoring communication in a network after a failure in a 
link, span or node of the network, and more particularly to 
restoration techniques in which restoration paths are pre- 
computed at nodes distributed throughout the network. 

BACKGROUND OF THE INVENTION 

All-optical networks using wavelength division multi- 
plexing (WDM) are increasingly being deployed for a wide 
variety of communication applications. WDM techniques 
allow optical signals having different wavelengths to be 
multiplexed into a single optical fiber. Current WDM 
deployments allow multiplexing of up to about 16 different 
wavelengths on a single fiber, but systems multiplexing 32 
or more different wavelengths on a single fiber are expected 
to become available soon. Each of the wavelengths serves as 
an optical carrier and can be used independently of the other 
wavelengths, such that different wavelengths may use dif- 
ferent modulation formats to carry different signal types. In 
a simple example, each wavelength may carry a modulation 
signal representing a synchronous optical network/ 
synchronous digital hierarchy (SONET/SDH) client 
payload, where each client is a SONET-rate TDM applica- 
tion and the common carried signals are in an OC-48 or an 
OC-192 format. 

FIG. 1 shows a conventional optical routing device 10 
which includes a wavelength selecting cross-connect 
(WSCC) 12, two input optical fibers 14-1, 14-2 and two 
output optical fibers 14-3, 14-4. The routing device 10 in this 
embodiment is configured to route incoming optical signals 
at wavelengths and ^2 fiber 14-1 to output fibers 14-4 
and 14-3, respectively, and to route incoming optical signals 
at wavelengths \^ and X^' on fiber 14-2 to output fibers 14-4 
and 14-3, respectively. The WSCC 12 thus serves to cross- 
connect incoming wavelengths on a given input fiber to 
different output fibers, but does not provide any transfor- 
mation in wavelength. When only this type of routing device 
is present in an optical network, the network typically routes 
a given end-to-end demand using a single wavelength. If a 
primary network path assigned to the given demand fails, the 
demand generally must be carried on a secondary or resto- 
ration path using exactly the same wavelength as the pri- 
mary path. 

FIG. 2 illustrates an optical network 20 in which wave- 
length transformations may be provided for signals travers- 
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ing the network, but only at the interface between a client 
and the optical network. A first client equipment (CE) device 
18-1 communicates with a second CE device 18-2. The first 
CE device 18-1 uses wavelength and the second CE 

5 device uses wavelength The first CE 18-1 transmits a 
signal at to a wavelength adapter 22 which maps the 
incoming wavelength to an outgoing wavelength Xj. A 
wavelength adapter (WA) is a device which allows conver- 
sion of wavelength at the client- network interface. The 
wavelength Xj is used to cany the modulation signal of CE 
18-1 from an access node 24 of network 20 lo an egress node 
26 of network 20. The egress node 26 delivers the X2 signal 
to a second WA 28 which maps the wavelength X^ to 
wavelength X3 for transmission to the second CE 18-2. In the 

J 5 event of a failure in the primary path through optical 
network 20 from CE 18-1 to CE 18-2, a secondary or 
restoration path with a different wavelength, such as X4, may 
be used to transport the customer demand through the 
network 20. Other types of optical network elements com- 

20 bine features of the WSCC 12 of FIG. 1 and the WAs 22, 28 
of FIG. 2. For example, a wavelength interchange device 
may be used to cross-connect incoming wavelengths onto 
different output fibers while also providing transformation of 
wavelengths. Such devices are called wavelength intcr- 

25 changing cross-connects (WlCCs). 

An important issue in the design of large-scale optical 
networks including WSCCs, WAs, WICCs and other optical 
signal routing devices relates to traflBc restoration in the 
event of a failure in a link, span or node. A simplistic 

30 approach to restoration in an optical network is to provide 
complete redundancy, such that the network includes a 
dedicated back-up or secondary connection for each primary 
connection of the network. When a link, span or node of the 
primary connection fails, traflBc may then be switched onto 

35 the corresponding elements of the secondary connection. 
Unfortunately, this approach uses a large amount of resto- 
ration capacity and therefore may be undesirable in many 
networks. More sophisticated approaches involve the use of 
a path restoration algorithm to provide automatic restoration 

40 of network trafiSc in the event of a primary path failure, 
while sharing restoration capacities whenever possible. 

It should be noted that large-scale optical networks typi- 
cally include a large number of spans, and two different 
point-to-point links may share a common span section. FIG. 

45 3 illustrates a shared span section in a portion of a network 
including nodes A, B and C. The dotted lines AC and AB 
represent two distinct optical links. The physical layout, 
shown by solid lines, is such that both of these links share 
the span AS. If this span fails due to a fiber cut or other 

50 problem, then both the links AC and AB will fail Thus a 
demand using link AB on its primary path cannot be restored 
on a route using link AC. It is therefore important that a 
given restoration algorithm achieve restoration of network 
trafSc in the event of span failures as well as link failures, by 

55 providing distinct spans and links for the restoration path. 
Furthermore, to decrease vulnerability of the network to 
node failures, it is also desirable to perform automatic 
restoration in the event of single node failures. Thus the 
overall goal of an effective restoration algorithm should be 

60 to perform automatic restoration in the event of single link, 
span or node failures. The term "automatic" connotes res- 
toration by control computers in the network, rather than by 
manual intervention, thus permitting fast restoration. 
FIG. 4 shows a portion of a network including nodes A, 

65 B, C and D providing a bidirectional path at a wavelength X^ 
between a first CE 18-1 and a second CE 18-2. In simple 
optical networks, failures are generally discovered through 
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signal strength measurements, which may be collected for 
each individual wavelength at each node of the network. If 
a link failure occurs between nodes B and C as shown in 
FIG. 4, the bidirectional nature of the path allows each of the 
nodes A, B, C and D to detect a loss of signal (LOS) 5 
condition, but, with only the LOS information, none of these 
nodes will know the exact location of the failure. As a result, 
local restoration around the failed link, by the nodes con- 
necting the failed link, is generally not possible, assuming 
that the optical network under consideration does not 
employ any other mechanism to isolate failures. This inabil- 
ity to determine the exact location of the failure from LOS 
information also requires that the restoration path be disjoint 
from the primary path. Depending on whether a network 
includes WSCCs, WAs or WICCs, additional restrictions 35 
may be imposed on the restoration and primary paths of a 
demand. The network path of FIG. 4 includes only WSCCs, 
and the secondary or restoration path therefore must have 
the same wavelength as the primary path. As previously 
noted, more complex networks such as the network of FIG. 20 
2 also include WAs, such that the restoration path could have 
a different wavelength than the primary path, although the 
same wavelength generally must be used from an access 
node such as node 24 of FIG. 2 to an egress node such as 
node 26. FIG. 5 shows a more general situation in which 25 
nodes A, B, C and D between source CE 18-1 and destina- 
tion CE 18-2 each include WICCs. ^Fhe FIG. 5 network thus 
permits local wavelength transformations at each node, such 
that the path from the access node A to the egress node D 
need not be at a single wavelength. 30 

Path restoration techniques may be classified in many 
dimensions. A first classification is based on where the paths 
are computed. In that sense, the restoration computation may 
be either centralized or distributed. In the former, the res- 
toration path computation is done at a central controller 35 
which has global information regarding the network. In 
distributed restoration, each node computes the restoration 
paths for demands passing through that node. Another 
classification of path restoration techniques depends on 
where the restoration action is implemented. In particular, 40 
the restoration may be local, end-to-end path based or 
hybrid. In local restoration, the nodes closest to the point of 
failure initiate restoration action for all demands affected by 
a given failure. In end-to-end path based restoration, the 
source -destination node pairs of demands affected by a 45 
given failure initiate the restoration action. Hybrid restora- 
tion approaches have aspects of both local and end-to-end 
path-based restoration in that they seek to find the best 
restoration path, in terms of minimizing the required spare 
capacity, that is closest to the point of failure. 50 

Restoration techniques may also be differentiated by the 
time at which the restoration paths are computed. 
Discovery-based approaches determine restoration paths 
after a failure event has occurred, while precomputed 
approaches determine restoration paths before the failure 55 
event and the failure event merely triggers the activation of 
the precomputed paths. Discovery-based approaches may be 
centralized or distributed, but their defining characteristic is 
that they compute restoration paths in real time, after the 
failure occurs. Centralized discovery-based approaches use 60 
some mechanism (e.g., alarms) such that the network ele- 
ments detecting failures can communicate to the central 
controller, which then computes the best available paths. In 
the distributed discovery-based approach, as soon as a 
failure event occurs, the nodes affected by the failure need 65 
to find out where spare capacity is available, and to create 
restoration paths by reserving available spare capacity on 



selected paths. If two requests contend for the same spare 
capacity, then some form of contention-resolution procedure 
is needed to resolve the contention. The capacity search 
procedure, including contention resolution, is performed 
after the failure but before the demands affected by a failure 
can be rerouted. As a result, for distributed discovery-based 
restoration, the restoration times tend to be large and/or the 
spare capacity utilization is poor. Moreover, many of the 
constraints imposed by optical networks do not allow imple- 
mentation of simple distributed discovery-based approaches 
to restoration. 

A prior art centralized precomputation technique is 
described in J. Anderson, B. T, Doshi, S. Dravida and P. 
Harshavardhana, "Fast Restoration of ATM Networks," 
JSAC 1991, which is incorporated by reference herein. In 
centralized precomputation, a central controller in the net- 
work stores information on the entire network topology as 
well as capacities of all links in the network. This controller 
runs an optimization algorithm with the objective of com- 
puting alternate paths for every possible failure in the 
network while utilizing minimum redundant capacity, and 
routing tables specifying these alternate paths are down- 
loaded to the appropriate network elements. When a failure 
is detected by a network element, it activates the correspond- 
ing alternate routing table. Similar action is taken by all the 
network elements as and when the elements receive the 
failure information. 

A drawback of this approach is that it requires a central 
controller of substantial computing capacity and may there- 
fore be hard to implement as the network increases in size. 

A prior art distributed discovery-based computation tech- 
nique is described in W. D. Grover, "The Self-Healing 
Network: A Fast Distributed Restoration Technique for 
Networks Using Digital Cross Connect Machines," IEEE 
Globecom 1987, and U.S. Pat. No. 4,956,835, issued to W. 
D. Grover on Sep. 11, 1990, both of which are incorporated 
by reference herein. In this approach, when a link failure is 
discovered, the nodes at the two ends of the failed link 
initiate a search for spare capacity in the network on links 
that are potential candidates for alternate routing. The avail- 
able spare capacity is then allocated on a first-come-first- 
served basis by one of the nodes to which the failed link is 
attached. This approach suffers from a number of significant 
drawbacks, especially under the constraints likely to be 
present in typical conventional optical networks. First, the 
discovery of spare capacity after failure introduces at least a 
round-trip delay between nodes, thereby increasing restora- 
tion time. Second, even if sufiScient spare capacity is avail- 
able to restore all trafi&c affected- by the failure, the fact that 
capacity is allocated on a first -come-first -served basis may 
not allow full restoration in practice. Third, these and most 
other distributed discovery-based techniques are fundamen- 
tally intended for restoration of single link failures in 
networks in which the location of the failure can be identi- 
fied. This is because the node at one end of the failed link 
initiates capacity discovery. Since typical conventional opti- 
cal networks do not have failure isolation capability, the 
distributed discovery-based approach will not work for such 
networks. Finally, the distributed discovery-based approach 
exemplified in the above-cited Grover references does not 
work well for node failures. In the case of a node failure, the 
burden of discovering spare capacity falls on multiple nodes, 
not just the nodes on the two ends of the failed link. The 
Grover approach generally cannot be used by multiple nodes 
simuhaneously. 

Variants of the Grover approach are described in C. H. 
Yang et al., "FITNESS: Failure Immunization Technology 
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for Network Service Survivability," IEEE Globecom 1988, 
and C. Edward Chow, J. Bicknell, S. McCaughey and S. 
Syed, "A Fast Distributed Network Restoration Algorithm," 
IEEE Globecom '93, pp. 261-267, 1993, and S. Hasegawa, 
Y. Okanone, T. Egawa and H. Sakauchi, "Control Algo- 5 
rithms of SONET Integrated Self-IIealing Networks," 
Unfortunately, none of these variants overcome the funda- 
mental deficiencies of the discovery-based Grove r approach. 
Generally, simultaneous attempts by multiple nodes to dis- 
cover and reserve restoration capacity require multiple mes- lO 
sage exchanges, contention resolution and path calculation, 
and the variants are thus unable to avoid excessive restora- 
tion delays. 

There is presently no end-to-end path restoration 
approach which provides a distributed precomputation tech- ^5 
nique suitable for use in an optical network. Although a 
distributed precomputation technique is described in U.S. 
Pat. Nos. 5,435,003 and 5,537,532, both entitled "Restora- 
tion in Communications Networks" and issued to R. S. K. 
Chng, C. R Botham and M. C. Sinclair, this distributed 20 
precomputation technique is not well-suited for use in an 
optical network which includes WSCCs, WAs, WICCs or 
other typical optical routers such as those described in 
conjunction with FIGS. 1, 2, 4 and 5 above. The technique 
p recomputes alternate paths for certain failure scenarios. 25 
After a failure occurs, a pair of end nodes affected by the 
failure attempt to find alternate paths in real-time. If a 
precomputed restoration path exists, the end nodes switch 
traffic to the precomputed path while the real-time paths are 
being computed. Once the best real-time path is computed, 30 
if the end nodes determine that the real-time path is better 
than the precomputed path, they switch traffic to the real- 
time path. This technique suffers from a number of draw- 
backs which render it of limited utility in an optical network. 
For example, the technique fails to address the possibility 35 
that a path computed in real-time for one failure scenario 
may overlap with precomputed paths for another scenario. 
Also, a path computed for one demand in real time may 
overlap with a precomputed path for another demand. There 
is no procedure for resolving conflicts between demands 
contending for restoration capacity on the same link, either 
during precomputation or during real-time computation. In 
addition, the technique does not support the use of failure- 
disjoint alternate paths for situations in which fauh isolation 
is not possible. Moreover, the technique provides no 45 
resource optimization other than that of picking the best path 
among a set of ad hoc paths. These and other deficiencies of 
the approach of U.S. Pat. Nos. 5,435,003 and 5,537,532 
render it of limited value in a complex, large-scale optical 
network. The prior art thus fails to provide a distributed so 
precomputation restoration approach which provides accept- 
able performance in a* large-scale optical network. 

It is therefore apparent that a need exists for improved 
network restoration techniques which utilize distributed 
precomputation to provide path restoration in large-scale 
optical networks after link, span or node failures, while 
avoiding the problems associated with the above-described 
conventional restoration techniques. 



SUMMARY OF THE INVENTION 



60 



The present invention provides techniques for performing 
distributed precomputed restoration in a large-scale optical 
or electrical network. In accordance with a first aspect of the 
invention, a distributed precomputation algorithm is pro- 
vided which is suitable for implementation by the nodes of 65 
an all-optical network. The precomputation of primary and 
restoration paths may be performed at the source or desti- 



nation nodes of a demand in an optical network in which 
links are constrained in terms of the particular optical signal 
wavelengths which may be carried thereon, and in which 
failures cannot be isolated using an automatic failure detec- 
tion capability. These factors have heretofore prevented the 
use of conventional distributed precomputation techniques 
in optical networks. The distributed precomputation algo- 
rithms of the present invention may be used to determine a 
restoration path for a given demand under network capacity 
constraints, such that the restoration path may be used to 
satisfy the demand in the event of a failure on a primary path 
of the demand. The algorithm may also be used to determine 
only the primary path for the demand, or both the primary 
and restoration paths for the demand. Various embodiments 
of the distributed precomputation algorithm may be config- 
ured to be partially or fully asynchronous. 

In accordance with a second aspect of the invention, the 
distributed precomputation algorithm utifizes at least one 
parallelization mechanism so as to improve the efficiency of 
the algorithm. For example, the parallelization mechanism 
may provide contention resolution among one or more 
contending demands. The contention resolution may be 
based on locking the primary path associated with a given 
demand, or may be based on a set partitioning approach. 
Another parallelization mechanism may involve processing 
the demands in an order which has been predetermined to 
improve a path search operation of the distributed precom- 
putation algorithm. 

In accordance with a third aspect of the invention, the 
distributed precomputation algorithm may include a first 
phase in which link capacity is allocated to demands to the 
extent possible without violating capacity constraints, and a 
second phase in which contentions between demands for the 
same capacity are resolved. The first phase may include a 
contention locking mechanism which locks a primary path 
of a given capacity demand to prevent other demands from 
contending for the same capacity. The second phase of the 
distributed precomputation algorithm may optimize the use 
of network capacity by rerouting paths previously allocated 
to one or more demands in order to free up capacity required 
for another demand. 

In accordance with a fourth aspect of the invention, a link 
capacity control feature may be provided in a distributed 
precomputation algorithm by storing a link status table at 
one or more nodes controlling a given one of the links. The 
link status table identifies a number of specific failures and 
demands currently routed through the link which are 
affected by the failures. The table may also provide, for one 
or more of the specific failures, an indication of the spare 
capacity available on the link. 

In accordance with a fifth aspect of the invention, a hybrid 
centralized/distributed approach is provided in which certain 
information used in distributed precomputation is down- 
loaded from a central controller of the network. The down- 
loaded information is used as an input to a distributed 
precomputation algorithm implemented by the network 
nodes in order to provide improved performance. The cen- 
tral controller makes use of its knowledge of global network 
topology in computing the information to be downloaded to 
the nodes. The downloaded information generally represents 
network information which changes with less frequency 
than other network information processed by the nodes as 
part of the distributed precomputation algorithm. The infor- 
mation downloaded from the central controller may include, 
for example, network connectivity information regarding 
interconnection between the network nodes in the absence of 
failures, capacities for at least a subset of the links in the 
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network, a subset of all possible routes between a source second phase of a distributed precomputation algorithm in 

node and a destination node in the network arranged in an accordance with the invention. 

appropriate initial search order, a link contention index for piGS. 13A and 13B illustrate the operations at a source 

each of the links in the network, wherein the contention node in a search for primary and restoration paths in another 
index for a given link is indicative of a total number of paths 5 exemplary distributed precomputation algorithm in accor- 

which include the given link, and indices assigned to one or dance with the invention. 

more of the nodes and Unks of the network, wherein the pj^g jj^ operations assodated 

indices are used by the distributed precomputation algonthm ^jh the path search of HGS. 13A and 13B. 

to determme an order of processmg Operations involvmg the r> ^ / t , « 

one or more nodes and links of the network. Other types of lO '^^^''^^^ 'h^ operations of a node reoeivmg a 

centrally computed information may also be downloaded to refute request generated in the path search of FIGS. 13A 

the nodes for use in a distributed precomputation algorithm ^ 

in accordance with the invention. FIGS. 16A and 16B illustrate operations performed asyn- 

Tlie distributed precomputation techniques of the present chronously by source-destination node pairs in a search for 
invention are well-suited for use in complex optical net- 15 pnmary and restoration paths m accordance with the 

works which include wavelength select devices, wavelength "^venUon, m a case m which the same wavelengths are 

adapters, wavelength interchange devices and other types of ^^^^ ? P"°^^^ restoration routes and there is no 

optical routers. The techniques may be used to precompute ^^^"^^ of restoration capacity. 

restoration paths in a distributed manner, such that the FIGS. 17A and 17B illustrate operations performed asyn- 
restoration lime is significantly reduced. The techniques are 20 chronously by source -destination node pairs in a search for 

capable of operating in a failure-independent maimer, and primary and restoration paths in accordance with the 

can therefore accommodate situations in which failure iso- invention, in a case in which sharing of restoration capacity 

lation is not possible, while also utilizing failure isolation ^ permitted. 

information to do failure-dependent alternate routing when DETAILED DESCRIPTION OF THE 

such information is available. The techmques make more 25 INVENTION 
efiScient use of spare capacity than the conventional tech- 
niques described above, through the use of the above -noted The invention will be described below in several sections 
link capacity control mechanism and path optimization in accordance with the following outline, 
procedure. In addition, the link- level information storage 1. Exemplary Network and Node Configurations 
and inter-nodal message exchange constructs of the inven- 2, A PartiaUy Asynchronous Distributed Precomputation 
tion provide a graceful transition to a real-time discovery- Algorithm 

based approach for failure scenarios that are outside the 3 ^ p^^^ Asynchronous Distributed Precomputation 

scope of precomputed restoration. These and other features Algorithm 

and advantages of the present invention will become more . .1. ^ . r» 1 *■ n j e * n *• 

, c ^ ,u J A 4.U e u 4. Alternative Contention Resolution Based on Set Parti- 

apparent from the accompanying drawings and the follow- . . 

ing detailed description. ^ . , „ . , , „ . „ , 

5. A Distnbuted Precomputation Algonthm Using Probe 

BRIEF DESCRIPTION OF THE DRAWINGS and Request Messages 

no. 1 illustrates wavelength routing using a wavelength ^"r*'"^ CentraHzed/Distributed Precomputation 
1 , , /\i7cr-r^\ & & ^40 The invention will be illustrated in conjunction with an 

select cross-connect (WSCC). , , , 1 i. u u u j * j 

^ ^ exemplary optical network. It should be understood, 

BG. 2 illustrates one type of wavelength routing in an however, that the invention is not Umited to use with any 

optical network. particular type of network, but is instead more generally 

FIG. 3 illustrates a pair of optical links sharing a common applicable to any network in which it is desirable to provide 
span. 45 improved restoration. For example, the distributed precom- 

FIGS. 4 and 5 illustrate alternative wavelength routing putation restoration techniques may be utilized in telephone, 

techniques. cable and other electrical networks. The term "network" as 

FIG. 6 shows an optical network in which the distributed used herein is therefore intended to include not only optical 

precomputation techniques of the present invention may be networks, but also electrical networks and hybrid optical- 
implemented. 50 electrical networks. The term "distributed" refers generally 

HG. 7 is a more detailed diagram of a node in the to computation which occurs at multiple nodes of a network, 

exemplary network of FIG 6 rather than in a central controller or other single centralized 

FIG. 8 is a flow diagram illustrating the operation of a fiist "precomputed;' refers to compulation 

phase of a distributed precomputation algorithm in accor- ^1"'='' ""^""^ P™^ '° °' substanUally contemporaneously 

dance with the invention. with a link, span or node failure, as opposed to m response 

,^.^r, , n.^ r, ■• ■ ■ to au iudicatiou of thc failuK. 

FIGS. 9A and 9B are flow diagrams illustrating the restoration techniques of the present invention 

operation of a second phase of a distnbuted precomputation . substantial advantages in a number of different 

algorithm m accordance with the invention. |. ^. ^ t *u - »• ™ u j • 

* apphcations. For example, the invention may be used m an 

FIGS. lOA through lOD are flow diagrams illustrating the apphcation in which a network of given capacity is deployed 

operation of a first phase of a fully asynchronous distributed ^ ^^i^ ^nd it is desired to restore the maximum amount 

precomputation algorithm in accordance with the invention. ^^^^ ^^^^^ j^e Umitations of existing capacity in the 

FIGS. IIA through HE are flow diagrams illustrating the network. The invention is also suitable for use in applica- 

operation of a second phase of the fully asynchronous tions in which, in addition to making the best use of existing 
distributed precomputation algorithm of FIGS. lOA-lOD. 55 capacity, it is desirable to compute the minimum amount of 

FIG. 12 is a flow diagram illustrating an alternative additional capacity necessary to provide 100% restoration of 

technique based on set partitioning for implementing the traflBc in the event of single link, span or node failure, and 
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to determine the links on which this additional capacity is FIG. 7 shows one of the nodes 56-/ of network 52 in 

needed. For example, an illustrative embodiment of the greater detail. The node 56-/ includes a nodal processor 58-/ 

invention may take an aggressive approach of striving for which includes a central processing unit (CPU) with 

high survivability performance characterized by as short a memory. A set of input links 64, corresponding to fiber 

restoration time as possible, in the range of hundreds of 5 connections 62 with three other nodes, are connected to 

milliseconds, while achieving a high network efficiency. It is buffers 70-1, 70-2 and 70-3 in node 56-/. The node 56-/ 

recognized that not all applications demand a uniformly supplies signals to three other nodes via a set of output links 

rigorous restoration time of a few hundred milliseconds. ti^^J^^^V" ^^L^^t^^^-^^^^^^^^ 
Some appUcations, such as voice transmission, may be more r^^i^^Mim^^ 

tolerant than other appHcations, such as distributed comput- .^^^^^m 

. J or ' . a u i*. . 1 for Signals reccived.on mput-lmks 64, whilc the buffers 72-1, 
ing^The mvention provides sufficient flexibihty to accom- S2"2 andW may provide electrical-to -optical conversion 

modate these and other diverse network applications. transmitted on output links 66. The opera- 

For simplicity of illustration, the path restoration tech- ^^^nal link 60-/ of node 56-/ to the central controller 54 
niques will be described herein assuming directionality from includes an input operational link which is coupled to nodal 
source to destination nodes as if capacity allocation is also is processor 58-/ via an input buffer 74, and an output opera- 
directional. However, demands are generally full duplex and tional Unk which receives signals from nodal processor 58-/ 
the capacity of a given link is typically controUed by the two via an output buffer 75. The node 56-/ also includes a 
nodes at the ends of the link. The node closer to the source demand database 76 for storing demands for network 
of a capacity request usually makes the decision, but the capacity, and a set of routing tables which specify routing 
other node needs to update its information on available 20 paths through the network for particu lar demands. The 
capacity. In order to make the simplified description corre- dem and database 76 and rdutin g^abl^l^Tjmayjb^^ 
spond to full duplex capacity allocation, the following nents of a_common mc^oir^^itHin Wdg^56^/v-and may be 
description may refer to decision making links. In reality, combined with or otherwise associate'd'witHlhe memory of 
decisions are made by nodes, not links. When the following "o^lal processor 58-/. The node 56-/ has been simplified for 
description refers to a decision by a Hnk, it therefore implies 25 Purposes of illustration, and as noted above may include a 
that the node at the end of the link closest to the source of substantiaUy larger number of mput and output links, as 
message makes the decision, and the node at the other end ^^^"^^^^ ^ g^^^° apphcation. 
of this link updates its infonmation to reflect that decision. 2. A Partially Asynchronous Distributed 
When the following description refers to a message passing Precomputation Algorithm 
from one link to the next, it implies a message within the 30 piGS. 8, 9A and 9B illustrate the operation of an exem- 
node from a portion of the node controlling one link passing plary partially asynchronous distributed precomputation 
to a portion of the same node controlling the next link. It will algorithm in accordance with the invention. The algorithm is 
therefore be readily apparent to those skilled in the art that suitable for automatic restoration of network traffic in an 
the described techniques may be easily extended to cover optical network, such as that described in conjunction with 
typical full duplex applications. 35 FIGS. 6 and 7 above, in the event of single link, node or span 

failure. The algorithm provides end-to-end path restoration 

1. Exemplary Network and Node Configurations ^ distributed and asynchronous manner. The illustrative 

FIG. 6 shows an exemplary optical system 50 in which the algorithm is distributed in nature in that restoration paths are 
distributed precomputation restoration techniques of the computed and stored locally at each node, restoration path 
present invention may be implemented. ^Fhe system 50 40 routing tables are activated locally at each node, failure is 
includes an optical network 52 and a central controller 54. detected on a path by two end nodes independently, and the 
The network 52 includes a number of nodes 56-/, /"-I, path restoration does not require intervention of a central 
2, . . . N. Each of the nodes 56-/ includes a corresponding controller. The algorithm is therefore scalable and applicable 
nodal processor 58-/. The nodes 56-/ of network 52 are to networks of any size. The algorithm is asynchronous in 
interconnected by optical fiber connections 62. In this 45 that each node performs its computations independently 
example, each of the nodes 56-/ has a fiber connection to regardless of the status of the other nodes. The algorithm 
three other nodes. Each of the nodes 56-/ is also connected provides very fast restoration in that the local storage of 
to the central controller 54 via a corresponding operations precomputed restoration paths allows immediate activation 
link 60-/, shown as a dashed line in FIG. 6. The central of the restoration path in the event of a primary path failure, 
controller 54 and nodes 56-/ may each represent a main- 50 Unlike conventional distributed restoration techniques, the 
frame computer, a workstation, a microcomputer, a personal algorithm does not require any specific computation or 
computer or other suitable digital data processor pro- dissemination at the time of the failure, 
grammed to provide the path restoration techniques As noted above, failures are generally detected in all- 
described herein. It should be noted that the system of FIG. optical networks by measurements of optical signal power 
6 is simplified for purposes of illustration. The invention is 55 levels. For example, each wavelength may be independently 
well-suited for use in large-scale regional, national and monitoredand when the power level of that wave length fa Us 
international networks which may include many below a certain threshold, failure is assumed. At the optical 
subnetworks, each having hundreds of nodes. In an optical level, there is presently no suitable in-band or supervisory 
network for routing telephone calls, one or more of the signahng available, and it is therefore usually not possible to 
subnetworks may be associated with each local exchange 60 isolate the failure to a particular link, node or span. It is 
carrier (LEC) and inter-exchange carrier (IXC) of the net- generally only possible to conclude that a failure occurred 
work. The central controller 54 may or may not participate somewhere on an end-to-end path. The illustrative algorithm 
in network restoration, depending upon the restoration tech- to be described below accounts for this lack of a fault 
nique employed. For example, the hybrid approach isolation mechanism by computing restoration paths which 
described herein will utilize the central controller 54, 65 are node, link and span disjoint with the primary paths. This 
whereas the other illustrative restoration techniques provides an added advantage in that the resulting restoration 
described herein generally do not. paths are failure independent. Therefore, as soon as a failure 
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in an end-to-end path is delected, an alternate route using the will not create a deadlock. The link contention locking 

restoration path can be triggered without trying to evaluate mechanism in this implementation of the algorithm assumes 

exactly where the failure has occurred. The algorithm is also that two demands are contending if their primary paths share 

generalizable to failure dependent restoration, where fault any node, link or span. Other suitable contention locking 

isolation information is either available via signaling mecha- 5 mechanisms may also be used. After the locking message is 

nisms or is deduced from topology information propagated accepted at the other nodes along the primary path of d*y, the 

by the network. path search for demand d*y is initiated in step 85 using a 

The algorithm to be described below is separated into two distributed breadth-first search (BFS) technique. The BFS 
phases, Phase 1 and Phase 2, and includes a path search path search is a well-known process for finding a shortest 
procedure, a link capacity control procedure, a contention 10 path through a network, and will not be described in detail 
locking procedure, and an optimization procedure. Phase 1 herein. Alternative path search techniques, including depth- 
is illustrated in FIG. 8, and Phase 2 is illustrated in FIGS. 9A first search techniques, could be utilized in alternative 
and 9B. The optimization procedure is part of Phase 2 and implementations. The nodes, links and spans of the primary 
is used for demands which fail to find a restoration path in path of d^ , except for its two end nodes, are removed from 
Phase 1. It should be noted that Phase 2 optimization 15 consideration in the path search, such that any resulting 
procedure is optional, and may be eliminated in alternative restoration path will be node, link and span disjoint with the 
embodiments of the invention. The objective of the illustra- primary path as previously described. The path search is 
tive algorithm is to compute node and link disjoint paths for fully distributed in that each node has information only 
each demand in the network by making the best use of about its neighboring nodes in the network. If the BFS path 
available spare capacity. That is, the algorithm as illustrated, 20 search of step 85 results in a restoration path being found for 
given a network topology, a set of demands and their demand d*y, steps 86 and 88 indicate that the lock on the 
primary routes, and link capacities, finds restoration paths primary path is released, the restoration path information is 
for the maximum number of demands. Alternative versions stored for demand d*y, and Phase 1 of the distributed 
of the algorithm may be used to solve other network precomputation algorithm is complete for that demand. The 
problems, such as determining the minimum additional 25 P^^b information may be stored in the demand database 76 
capacity required to achieve complete restoration. It should or routing tables 77 of the node as shown in FIG. 7, or in any 
be emphasized that while the alternate path is failure other suitable memory location accessible by the nodal 
independent, the utilization of spare capacity is in fact processor 58. If step 86 indicates that no restoration path was 
failure dependent, and the illustrative algorithm may be located in the BFS path search for demand d*y, the algorithm 
configured to exploit this feature. The source node of a given 30 then releases the lock on the primary path, and waits to enter 
demand or a set of demands finds restoration paths for these Phase 2 for demand d*,y as shown in step 87. 
demands and generally does not have to worry about FIGS. 9A and 9B illustrate Phase 2 of the distributed 
demands originating elsewhere. In this sense the algorithm precomputation algorithm as implemented at node i for 
is fully distributed. As the computation progresses, it may be demand d*y. Step 90 corresponds to step 87 of FIG. 8, and 
necessary, during the optimization procedure of Phase 2, to 35 shows the demands processed by node i waiting for the start 
change a previously computed restoration path for a demand of the Phase 2 portion of the algorithm. The receipt of a 
in order to provide a restoration path for another demand. "Start of Phase 2" message causes the process to perform 
The illustrative algorithm assumes that the network does not steps 92 through 100, which operate in manner similar to 
include wavelength interchange devices, such as those that described above for steps 80 through 85 of FIG. 8. If the 
described in conjunction with FIG. 5 above, but alternative 40 BFS path search of step 100 results in a restoration path for 
versions suitable for use with wavelength interchange demand d*y, step 102 indicates that the path information is 
devices will be apparent to those skilled in the art. stored, and the lock placed on the primary path of demand 

FIG. 8 shows a flow diagram of Phase 1 of the algorithm d*y in step 94 is removed. If step 104 indicates that a 

as implemented at a particular node i of the network. The restoration path is not found for demand d*y, a set of 

notation d*y will be used to denote the kth demand origi- 45 "bottleneck'* links with no available spare capacity (also 

nating at node i and terminating at node j. An independent referred to as a "cut" set) is identified as a side product of the 

path search is performed for each demand for which a path search procedure, and a contending demand d„ utilizing 

restoration path is to be computed at node i. In step 80, a these links is selected for negotiation, as shown in step 106. 

demand d*y which does not yet have a restoration path is A second type of locking message, referred to as a "Type 2" 

selected for processing at node i. Before the path search 50 locking message in order to distinguish it from the locking 

procedure is started for demand d*y, a locking message is messages used in step 82 of FIG. 8 and step 94 of FIG. 9A, 

sent along the primary path of d*y in order to lock out other is then sent along the primary path of the demand d„ in order 

demands contending for the same spare capacity, as shown to lock out other demands contending for the same spare 

in step 82. The reason for locking d*y's primary path is that capacity, as shown in step 108. If the locking is determined 

only demands whose primary paths share a node or link with 55 to be successful in step 110, the algorithm proceeds to FIG. 

d* -^s primary path may be disrupted by a single failure at the 9B. If the locking was not successful, which may be the case 

same time as d^., and therefore may require capacity to if other demands already have a restoration computation 

activate their restoration paths at the same time as d*y. As a underway at another node, the algorithm wails for a random 

result, only these demands contend with d*,y for network amount of time in step 112, and then returns to step 108 to 

capacity. A determination is made in step 83 as to whether 60 send the Type 2 locking message again, 

or not the locking was successful. If the locking was not In step 114 of FIG. 9B, the restoration path for demand d„ 

successful, which may be the case if other demands already is taken out of the set of allocated paths and the capacity the 

have a restoration compulation underway at another node, d„ restoration path consumed is released. A BFS path search 

the algorithm waits for a random amount of time in step 84, is again initiated for d*,y using only links with available 

and then returns to step 82 to send the locking message 65 capacity, as shown in step 115. If step 116 indicates that no 

again. Contention locking ensures that two contending path is found for d*y, the rerouting of dn is unsuccessful, 

demands performing path search operations at the same lime Step 117 indicates that the process then returns to step 106 
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of FIG. 9 A, after releasing the lock on the current d„, in before it knows whether it can use a link. To do so, source 

order to identify another demand d„ for negotiation. If step node in sends the list of failures that affect d^'^^^s primary 

116 indicates that a path is found for demand d*y, a BFS path path. Since at this time each link has available spare 

search is initiated for d„ using only links with available capacity, the first query of each link always returns a "yes" 

capacity, as shown in step 118. If step 119 indicates that a 5 answer. After a restoration path is computed by the path 

path is also found for d„, then the rerouting of d„ is search procedure, a request is sent to reserve one unit of 

successful. The locks on d ,y and d„ are then released in step bandwidth on each link on the restoration path. Each link on 

120, the new paths for d*;,- and d are stored in step 121, and the end-to-end path grants the reservation and updates its 

the algorithm ends for demand d ^ . If step 119 indicates that link status table. Suppose that at some later time t, the status 

no path is found for d„, the restoration path ofd^j is removed 10 of Unk mn is as shown in the above link status table, and 

and the capacity it consumed is released, as shown in step another node g is computing a restoration path for a demand 

122, A cut set of links for d„ was identified as a side product <j"^^. xhe node g then sends a query to the links involved in 

of the failed path search of step 118. In step 123, d^. is ihe path search. Suppose g wants to know whether link mn 

rerouted using a shortest path search which avoids links in has capacity for demand d"^;,. Assume that d"^^ is affected 

the cut set as much as possible by assigning high costs to ^5 by failure f^. When this query is received a1 node m, it 

those links. After rerouting d^- in this manner, a BFS path indexes the mn(t) link status table using the failure and 

search is initiated for d„ in step 124, using only links with determines that the available spare capacity is already 0 and 

available capacity. If step 125 indicates that a path is not that the demands contending for this spare capacity are d''^^,, 

found for d„, the rerouting of d„ was unsuccessful. The a^^^ and as shown in the table. A "no" answer together 

process then returns to step 106 of FIG, 9 A, after releasing 20 with this information is sent back by node ra to node g. 

the lock on the current d„, in order to identify another ^ described above, all of the routing and capacity 

demand d„ for negotiation. If step 125 mdicates that a path computations proceed in a distributed manner in the algo- 

is found for d„, the rerouting of d„ is successful. The locks ^.^jhm of FIGS. 8, 9A and 9B. Each node performs routing 

on (fij and d„ are then released in step 120, the new paths for computations only for the demands originating at that node 

d ^. and d are stored in step 121, and the algorithm ends for 25 except while resolving contentions. Each node maintains 

demand d y. Asimilar process may be implemented for other ^^^k status tables only for the links incident on it. The entries 

demands originating at node i. It is assumed for simplicity ^ status tables correspond only to failures that 

m the subsequent descriptions that the node 1 acts as the necessitate the use of a particular Unk on the restoration path 

master in determmmg the route for the demand d ^j. ^ specific demand. ITie optimization procedure of Phase 

The algorithm illustrated in conjunction with FIGS. 8, 9A 30 2 is utilized only if the current demand is blocked and 

and 9B assumes without limitation that every node in the therefore unable to find a restoration path. The algorithm 

network maintains information on available spare capacity j^ay be utilized to compute restoration paths in the event of 

for each link incident on that node. Since available spare multiple link, span or node failures. For example, if a 

capacity depends on the failure, spare capacity information computed restoration path is affected by a multiple failure, 

is maintained on a failure-specific basis. In addition, infor- 35 the algorithm may be configured to permit a fall-back to the 

mation is also maintained on the specific demands that are ^se of end-to-end capacity discovery at the time of failure, 

contending for the available spare capacity. An example of using actual spare capacities available at the time of failure, 

a link status table representing this information at node m on ^ith contention resolved by a random backup technique, 

a link mn between nodes m and n at time t, denoted link Switching back to a primary path once the failure affecting 

mn(t), is shown below. The link status table is updated 40 the primary path is repaired is simplified by the fact that the 

continuously throughout the course of the distributed pre- algorithm does not utilize the capacities of the primary paths 

computation. For example, the table may be updated when- in generating the restoration paths, ^fhis is another advantage 

ever a new demand enters the link or an existing demand of the use of node and link disjoint restoration paths. The 

exits the link. The link status table may be maintained by a algorithm is also seamless for incremental changes to the 

single node at one end of the corresponding link, or by two 45 network. For example, the algorithm may be run continu- 

nodes at either end of the link with one of the nodes acting Q^sly in an incremental mode, in order to accommodate 

as a master and the other as a slave. The "Failure ID" newly provisioned demands, demands affected by capacity 

identifies a specific failure, while the "Demands" column changes in the network, or other demands which have not 

indicates which demands are affected by a particular failure. ygt found a valid restoration path. There is no need to restart 

The "Available Spare Capacity" indicates the amount of 50 the algorithm from scratch to accommodate these and other 

capacity which would be hypothetically available after the changes in demands and capacity. The algorithm may be 

occurrence of a particular failure. restarted from scratch after major provisioning or network 

capacity changes, or on a periodic but infrequent basis. 

55 3. A Fully Asynchronous Distributed 
Precomputation Algorithm 

FIGS. lOA-lOD and IIA-IIE show flow diagrams illus- 
trating the operation of another exemplary distributed pre- 
computation restoration algorithm in accordance with the 

60 invention. The algorithm described in this section is fiilly 
asynchronous, in that there is no coordination required in the 

As an illustration of the use of the above link status table, relative timing of the restoration path computation processes 

assume that the distributed precomputation algorithm starts carried out at different nodes. The lockout mechanism used 

at a time t=0 and that at this time the link status tables of all in the partially asynchronous algorithm of the previous 

of the links are empty. When source node m computes a link 65 section is therefore not needed. Again, d*y will represent the 

and node disjoint restoration path for a demand d^^ using kth demand between a source node i and a destination node 

spare network capacity, it has to query the link status table j in the network. It will be apparent that the fully asynchro- 
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nous restoration algorithm also provides many of the advan- be set to a reasonable maximum, such as ten. Even if the sum 

tages of the partially asynchronous algorithm described in of the round trip delay and processing time, with the 

the previous section. The fully asynchronous algorithm is above-noted hop constraint, is equal to about 300 msecs, the 

also divided into two phases. Phase 1 and Phase 2, with total time required to perform the steps of FIG. lOA for a 

Phase 1 illustrated in FIGS. lOA-lOD and Phase 2 illus- 5 large number of source nodes can be less than a minute. The 

trated in FIGS. IIA-IIE. Phase 1 generally provides initial source nodes that are successful on the first try will wait in 

capacity allocation for a set of demands in the network, step 134 for a period equal to the time needed for maximum 

while Phase 2 is a contention resolution phase which pro- retries plus a safety factor, such as one minute, to ensure that 

vides optimization of network capacity utilization. all demands have completed Phase 1, and will then enter 

FIG. lOA shows the Phase 1 process ste ps at the source Phase 2. Similarly, the source nodes that are successful in the 

node i for demand d*y. In 5t^l9.Ql|tfi^ngs!|Sg ags and second or third try will prorate their waiting time in step 134 

spansi0f ^t he|p ^^ M^^gEr^^ so that the total time before entering Phase 2 is less than 

er ationBnj t^ir^^ about one minute. 

operktiSnfisl^rSrmSS'^^ FIG. lOB shows the process steps of Phase 1 at the 

tgghmgug^The BFS path search may utilize a flooding 15 destination node j of demand d*^. In step 140, node j 

mechanism, in which a given source node simultaneously receives the first request from node i for demand d*y. Node 

attempts to find available capacity on multiple paths. As part j then starts a timer and waits for a random amount of time 

of the BFS path search, the source node i sends capacity as shown in step 142. After expiration of the random time 

allocation requests to the destination node j to discover and period, the process in step 144 determines whether at least 

reserve capacity. In the source-to-destination direction, each 20 P^^** ^^^^ available capacity has been received. If no 

intermediate node with available spare capacity allocates path with available capacity has been received after the 

and reserves capacity in accordance with the request. The expiration of the waiting time, the destination node informs 

information sent by the source node in a given request the source node that the path is unavailable, as shown in step 

message may include a demand identification (ID) for 146. If at least one path with available capacity is received 

demand d*^ and a corresponding failure set F*,y for the 25 in step 144, the destination node j in step 148 selects the best 

demand d*y. F*,;^- is the set of failures that can affect the of the paths with available capacity, and sends messages 

primary path of d*.^, and generally provides more informa- requesting the release of capacity on the remaining paths, 

tion that the primary path itself. Only those intermediate The best path may be the shortest path of the paths with 

nodes that have links that are unaffected by F*',^ will there- available capacity, the path with the greatest available 

fore reserve capacity, such that the resulting restoration path 30 capacity, or a path selected based on any other reasonable 

will be node and link disjoint with the corresponding pri- path selection criterion. The destination node j then informs 

mary path of d*y. In order to avoid excessively long paths in the source node i of the selected path, 

the BFS path search, a hop constraint may be incorporated FIG, IOC illustrates the Phase 1 process steps at nodes 

into the search. which perform link capacity control operations. It will be 

For the demand d^., the destination node j may receive 35 assumed that these nodes include a node m which performs 

multiple messages indicating that capacity has been alio- capacity control for a link mn. The node m receives a request 

cated on muhiple paths for d*y. The received messages also from node i for capacity for a demand d^.. In step 150, the 

indicate the end-to-end paths on which this capacity has node m performs a link capacity control check to determine 

been reserved. It will be assumed without limitation that for if link mn has suflBcient available capacity to accommodate 

each demand d*^-, there is some function of i,j and k which 40 demand d*y. If suEBcient capacity is not available in step 

decides whether i or j acts as the source for this specific 152, node m in step 154 sends a message to the next node 

demand. The destination nodej then sends return messages in the BFS path search indicating that link mn does not have 

back on the end-to-end paths indicated in the messages sufficient capacity to accommodate demand d*,y. If step 152 

received from the source node. The return messages specify indicates that sufiScient capacity is available on link mn, 

that capacity has been reserved on a selected one of the 45 node m in step 156 reserves the capacity for demand d*y, 

paths, and requests release of capacity on all the other updates the link status table for link mn, and sends a message 

possible alternate paths. The BFS path search may be to the next node in the BFS path search to the effect that 

modified such that each source node attempts to find capac- capacity has been reserved for demand d^ on link mn. 

ity on only a single path as opposed to attempting to find FIG. lOD illustrates additional Phase 1 process steps at 

capacity on a multiplicity of paths via flooding. 50 nodes which perform link capacity control operations. As 

In step 132, a determination is made as to whether the noted in conjunction with FIG. lOB above, the destination 

destination node j has sent an available restoration path with node j for demand d^,y selects the best path of those paths 

the capacity needed for demand d\-. If the needed path has having available capacity, and sends messages to request 

been sent, the process waits for the beginning of Phase 2 as release of capacity on the unselected paths. In FIG. lOD, a 

shown in step 134. If the needed path has not been sent, a 55 request for release of capacity is received from node j in a 

random backoff timer is started such that the process waits node m which had previously reserved capacity for demand 

for a random amount of time, as shown in step 136, before d*y as described in conjunction with FIG. IOC above. In step 

retuming'to the path search step 130. While it is possible that 158, the node m releases the capacity previously reserved for 

one specific demand geu reservation on multiple paths on its demand d*,y, and updates the corresponding link status 

first try, other demands may not have found even one 60 control table. The node m then acknowledges the release of 

end-to-end path with spare capacity. Therefore, source nodes capacity to the destination node j. 

that do not hear from the destination node within the random At the end of Phase 1, any source node which has found 

backoff time will try again, as indicated by steps 130, 132 a restoration path for at least a subset of its demands has the 

and 136 of FIG. lOA. The backoff timer may be set, for restoration path stored locally at the node. In addition, a fink 

example, by adding a random value with known mean and 65 status table in the format shown below is maintained for 

distribution to the sum of the round trip delay and the node each link. As noted previously, the link status table may be 

processing times. Moreover, the total number of retries can maintained by a single node at either end of the correspond- 
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ing link, or by both nodes in accordance with a master/slave 
arrangement. 
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The "Link Status" column of the above table can lake on one 
of four values. Link Status 0 indicates that spare capacity is 
available for a specific failure and that this link is ready to 
allocate and reserve capacity for a requesting demand. Link ^5 
status 1 indicates that this link is currently participating in 
contention resolution with another demand and therefore can 
neither allocate capacity nor can it allow the requesting 
demand to enter contention resolution. Link status 2 indi- 
cates that the link has no spare capacity but it can allow the 
requesting demand to enter contention resolution. Link 
status 3 indicates that the link has no spare capacity and also 
that it will not entertain any requests from demands to enter 
the contention resolution state. The link status table is 
continuously updated as it receives requests for capacity 
from demands. 25 

FIG. 11 A shows the Phase 2 operations of the fiilly 
asynchronous distributed precomputation algorithm at a 
source node i. Phase 2 begins for a particular demand d*y 
after the wait period in step 134 of FIG. lOA expires for that 
demand. The demands originating at node i thus enter Phase 30 
2 asynchronously. It will be assumed for this illustration that 
source node i was unable to find a restoration path for 
demand d*y in Phase 1 of the algorithm. It will also be 
assumed that node i and all the other nodes in the network 
have network topology information stored locally. This 35 
topology information may be acquired as a byproduct of the 
flooding mechanism associated with the above-described 
BFS path search of Phase 1. In step 160 of FIG. IIA, source 
node i removes the links, nodes and spans of the d*y primary 
path from consideration, and initiates a path search for the 40 
required capacity using the BFS technique. 

As part of the path search of step 160, source node i uses 
its locally-stored network topology information to send 
capacity requests to nodes in one or more end-to-end paths. 
The requests may contain identifiers of the source and 45 
destination nodes, an identifier of demand d*y, and a failure 
set for the primary path of demand d*y. The requests may 
also list the nodes on the corresponding proposed end-to-end 
path. In the forward direction from the source node to the 
destination node, each intermediate node reserves available 50 
spare capacity in accordance with the request and adds its 
link status to the message forwarded to the next node in the 
BFS path search. As will be described in greater detail 
below, the destination node sends the message back to the 
source node on the same end-to-end path. During transit of 55 
the capacity request messages in the forward and reverse 
directions, if an intermediate node observes that a previous 
or a subsequent link, respectively, has marked its link status 
as 1 or 3, then the intermediate node releases capacity it may 
have reserved for d*y. If an intermediate node observes that 60 
a previous link has listed its status as 0 or 2, then each node 
in the reverse direction adds additional information. The 
additional information added by a node with link status 2 
relates to contending demands. Such a node appends to the 
message a failure list and the corresponding contending 65 
demands. Source node i eventually receives this 
information, and uses it to resolve the contending demands. 
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In' step 162 of FIG. IIA, the destination node j sends 
information regarding bottleneck links and contending 
demands for a candidate end-to-end restoration path to the 
source node i by appending this information to the message 
reflected back to node i. The node i then requests permission 
to resolve contention from one or more contending 
demands, as shown in step 164. In step 166, a determination 
is made as to whether contention resolution permission has 
been received from all contending demands for the given 
restoration path. If permission has not been received from all 
contending demands, step 167 indicates that the source node 
i releases capacity on the candidate restoration path, starts a 
random backoff timer, and returns to step 160 after waiting 
for the expiration of the backoff timer. If contention reso- 
lution permission is received firom all of the contending 
demands in step 166, step 168 indicates that the algorithm 
then begins a contention resolution process, to be described 
in greater detail below. 

FIG. IIB shows the operations performed at the destina- 
tion node j of demand d*y during Phase 2 of the fully 
asynchronous algorithm. In step 170, the destination node j 
receives a first capacity request message from source node 
i for demand d*y along with information regarding bottle- 
neck links and contending demands. Node j then starts a 
random backoff timer, and waits for the expiration of a 
random amount of time, as shown in step 172. After expi- 
ration of the random time period, a determination is made in 
step 174 as to whether or not more than one capacity request 
has been received from source node i along with information 
on bottleneck links and contending demands. If step 174 
indicates that only one capacity request has been received, 
destination node in step 176 informs source node i of the 
corresponding bottleneck links and contending demands by 
placing this information in the message reflected back along 
the candidate end-to-end path to node i. If step 174 indicates 
that node j has received multiple capacity requests from 
source node i, such that there are multiple candidate end- 
to-end restoration paths, node j selects the candidate path 
having the least number of bottleneck links. Node j then 
sends messages requesting release of the capacity on the 
remaining candidate paths, and informs the source node i of 
the selected path and its corresponding botdeneck links and 
contending demands. 

FIG. lie shows the operations at nodes performing fink 
capacity control. The link capacity control is initiated when 
a given node receives a capacity request from source node 
i for demand d*y. In step 180, the given node performs a link 
capacity check. Step 182 indicates that if capacity is avail- 
able on the link controUed by the given node, the node 
reserves capacity on the link for demand d*y updates the 
corresponding link status table, and sends a message to the 
next node in the BFS path search, as shown in step 184. If 
the link does not have sufficient available capacity to accom- 
modate demand d*y, step 186 indicates that a determination 
is then made as to whether or not the link is akeady in 
contention for the specific failure associated with demand 
d*y. If the link is not already in contention for the specific 
failure, the contention status of the link is updated, and 
information on the contending demands is sent to the next 
node in the BFS path search, as shown in step 188. If the link 
is ah-eady in contention for the specific failure, step 190 
indicates that the node sends the link contention status and 
a "Try Later" message to the next node in the BFS. The Try 
Later message will indicate to source, destination and other 
intermediate nodes that the link may be removed from 
contention at a subsequent time, such that it might eventu- 
ally be used to supply capacity for the demand. 
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FIG. IID illustrates the response of a given node per- 
forming link capacity control to a request for release of 
capacity or a request for demand contention status from 
another node. In this illustration, a request for the release of 
capacity or contention status regarding demand d*y is 5 
received in the given node. In step 192, the given node 
responds to the request by either releasing capacity or 
updating the contention status of the link. If the given node 
releases capacity, it acknowledges the release of capacity to 
the requesting node. 30 

FIG, HE shows the process steps performed at a given 
node 1 which computes a restoration path for a demand d''/„. 
The node 1 receives a request for permission to resolve 
contention from source node i for demand d*y. In step 194, 
the node 1 determines whether or not the demand is is 
already under contention resolution. If demand d'',„ is not 
already under contention resolution, step 196 indicates that 
node 1 updates the contention status of demand d'',„ and 
sends contention resolution permission to node i. If demand 
d'';„ is already under contention resolution, step 198 indi- 20 
cates that node 1 denies contention resolution permission in 
an appropriate message to node i. 

An example will now be given to further illustrate the 
operation of the Phase 2 portion of the fully asynchronous 
algorithm. Suppose that source node i receives a link status 25 
of 2 for one specific link pq and that the contending demands 
for a specific fault f/ are d^^^ and d^^^ Then source node i 
sends a message to node a asking for permission to resolve 
contention on link pq between demand d*y and d^^^. It 
should be noted that each source node also mamtains a status 30 
for each of its demands. The demand status variable can take 
on one of three values. Demand status 0 indicates that the 
node gives its permission to another demand to resolve 
contention. Demand status 1 indicates that the node has 
already given permission to another demand to resolve 35 
contention. Demand status 2 indicates that the demand is no 
longer a candidate for resolving contentions with any other 
demand. 

Assume that the status of demand d^„^ is 0 when it 
receives a request from d*y. Then node a sends the status 40 
information as well as the fault set affecting the primary path 
of d^^^ and the alternate path of d^^^. Upon receiving this 
information, node i tries to resolve contention between d^^^ 
and d^^. Next, suppose the status of d^^^, is 1 when it 
receives a request from d*y. Then node a sends this status to 45 
node i. Then node i sends a request message to node e to try 
to resolve contention between d*y and d^^y. If the status of 
d^^y is also 1 then node i releases all the resources on the 
possible alternate path and tries either a different path 
immediately or the same path after the expiration of a 50 
reattempt timer. The distinction between demand status 1 
and 2 is that in the former case, a requesting demand can try 
later to see if it could resolve contention while in the latter 
case, it would not try in the future to resolve contention with 
this particular demand. The reattempt timer value can be 55 
made adaptive by having larger values in the beginning and 
decreasing them over time as a function of the link status 
indicators as well as the demand status indicators. 

In the above -described fully asynchronous algorithm, 
control over the capacity of a given link is maintained by the 60 
nodes at either ends of the link. Control over routing for a 
specific demand and permission to resolve contention is 
maintained by the source node associated with the demand. 
Furthermore, there is no need to maintain queues in order to 
prevent the possibility of deadlocks. ITie above-noted reat- 65 
tempt timers ensure that the contention resolution process is 
fully asynchronous. It should be noted that the timers in both 



Phase 1 and Phase 2 of the algorithm may be made adaptive 
such that their values decrease as computation progresses. 

4. Alternative Contention Resolution Based on Set 
Partitioning 

The contention resolution features of the distributed pre- 
computation algorithms described in the previous two sec- 
tions may be implemented using an alternative technique 
based on set partitioning. FIG. 12 shows a flow diagram 
illustrating the operation of the alternative contention reso- 
lution technique. The technique may be viewed as a replace- 
ment for the Phase 2 operations of the algorithms previously 
described, and generally assumes that contention-free res- 
toration paths have already been computed using the appro- 
priate Phase 1 operations. The technique attempts to mini- 
mize the need for deadlock resolution such that contention 
resolution may be achieved with the exchange of fewer 
messages and in less time than in the previous algorithms. 
However, the technique generally does not have the advan- 
tage of being fully asynchronous. 

The set partitioning contention resolution technique will 
be described using the following notation: 

d*y The k''' wavelength demand from source node i to 
destination node j. 

F(d*^) Set of all faults that affect demand d*y on its 
primary path. 

R Set of all demands for which restoration paths were 
found in Phase 1. 

UR Set of all demands for which restoration paths were 
not found in Phase 1 because of collisions. 

C(d^., P,) For d*y belonging to UR, C(d^-, P,) is the set 
of all routed demands in Phase 1 that d collides with 
on path PXd^y) belonging to P(d*,y), 

P(d*y) Set of all failure disjoint paths for demand d*y. 

PXd*,y) A particular element of the set P(d*y). 

CI(PXd*,y)) For d*^- belonging to UR, CI(PXd*y)) is the 
collision index for demand d*y on path PXd*y). The 
collision index of a path is a function of the number of 
links of that path on which a given demand collides 
with a routed demand, and the number of routed 
demands it collides with on each link. 

CI(d*y) Collision index of a demand d*y belonging to UR 
is the minimum of CI(PXd*,y)) over all paths in P(d*y). 

D(d*y) Set of all demands in UR whose fault sets are 
disjoint from F(d*y). 

II(d^.) For a demand d*,y in UR, II(d*^) is the set of all 
demands that can initiate contention resolution in par- 
allel with d*y. 

D(S) For any set S^UR, D(S) is the set of all demands 
in UR whose fault sets are disjoint from the fault set 
F(d) for every d belonging to S. 

NR Set of all demands for which no reroute can be found. 

It will be assumed without limitation that for every 
demand d*y belonging to UR, all of the quantities listed 
above are available from Phase 1 using one of the distributed 
computation algorithms described in the previous two sec- 
tions. For simplicity of illustration, the alternative conten- 
tion resolution technique will be described in detail below 
for a case in which no more than two demands are permitted 
to activate contention resolution in parallel. Extension to 
cases in which a substantially larger number of demands 
may be activated in parallel is straightforward, and will be 
readily apparent to those skilled in the art. 

FIG. 12 shows the steps of an exemplary implementation 
of the alternative contention resolution technique. In step 
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200, the demands d*y in the above-defined set UR are sorted 
in increasing order of collision index CI(d^-). The demand 
with the minimum collision index is the first demand on the 
sorted list of demands and is denoted d^. The path having the 
minimum collision index CI(PXd,)) for demand d^ is 5 
denoted Pj. In step 202, the disjoint set D(dj) is searched to 
determine if a demand 62 exists such that, for every d,^C(di, 
Pj) and dy€C(d2, PJ* where P2 is the path having minimum 
collision index C\(P2id^) for demand F(d,) is disjoint 
from F(d.), T^{d^) is disjoint from F(dj), and Vld^ is disjoint 
from F(d,). In step 204, a determmation is made as to 
whether or not the search for demand d^ in step 202 was 
successful. If a demand was located in the search of step 
202, step 206 indicates that the set II(d;i) of all demands that 
can initiate contention resolution in parallel is the set {d^, 
dj}. If no demand d^ was located in the search of step 202, 
step 208 indicates that the set Il(di) is the set {d^}. In either 
case, the process continues with step 210, in which a 
one-step contention resolution is initiated for all demands in 
the set ll(di). 

The one-step contention resolution of step 210 generally 20 
involves first attempting to reroute one or more demands in 
II(di), and then using the capacity made available by the 
rerouting to determine a restoration path for d^. A one-step 
contention resolution process, as applied to contending 
demands dj and dj where demand 62 is to be rerouted to 25 
avoid the contention, will only reroute d2 if dj has no 
contention on its proposed reroute path. If dj encounters 
contention on the proposed reroute path, then the contention 
resolution process between dand d2 will be terminated and 
dj will attempt contention resolution with another demand in 30 
II(di). Other suitable contention resolution mechanisms may 
also be used in place of the one-step resolution of step 210. 
Step 212 indicates that each demand d^- in U(d^) that is 
successfully rerouted in the contention resolution step 210 is 
then moved from the UR set to the R set, and the values of 35 
C(d*,y, P,), Cl(PXd\)) and CI(d^ ) are updated accordingly. 
Each demand dy in Il(di) that could not be rerouted in step 
210 is moved to the set NR. Step 214 indicates that steps 200 
through 212 are repeated until the set UR of demands 
without restoration paths becomes empty, at which time the 40 
contention resolution process ends. 

As an example of the operation of the FIG. 12 contention 
resolution process, assume there exists a demand d* in the 
set D(d*y) such that the set C(d*, P*) of demands that 
contend with d* on path P* is a subset of D(d*y). Under the 45 
above-referenced one-step contention resolution of step 210, 
both d*y and d* can simultaneously launch their contention- 
resolution mechanisms without any possibility of colliding. 
This is because the FIG. 12 process ensures that neither d* 
nor any of its contending demands C(d*, P*) have any fault 50 
in common with F(d*^y) and therefore neither d* nor any 
demand in C(d*, P*) can contend for capacity with d*y. 
However, it is possible that either d* or some demand in 
C(d*, P*) may contend for capacity with some demand in 
C(d*y, P^) as these demands may share a common fault. If 55 
that happens, the one-step contention resolution will disal- 
low both contention-resolution processes, between d*y and 
the demand in C(d*jy, P^) and between d* and the demand in 
C(d*, P*), and d*y and d* may then both resume contention 
resolution with other contending demands, if such other 60 
demands exist. If there is another demand d** in the set 
D(d*y) such that neither d** nor any demand in C(d**, P**) 
have any faults in common with the fault set of d* or any 
demand in C(d*, P*), then d*y, d* and d** can all perform 
one-step contention resolution in parallel. 65 

As noted above, the alternative contention resolution 
technique of FIG. 12 assumes the completion of a Phase 1 
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restoration path determination for each non-contending 
demand. Every contending demand d of set UR is then 
ordered in the manner described in conjunction with step 
200. The set C(d, P) of all demands that demand d contends 
with on path P is thus also assumed to be available. The 
maximum number of simultaneous demands d, d*, d**, . . . 
that can launch one-step contention resolution is then found 
in the manner described in conjunction with steps 202 
through 208. This part of the technique may be implemented 
in a centralized or distributed manner. If a distributed 
implementation is desired, then a common ordering of all 
contending demands is used, as per step 200, so that all 
source-destination nodes find the same set of demands d, d*, 
d**, ... for simultaneous contention resolution. 

5. Distributed Precomputation Algorithms Using 
Probe and Request Messages 

A number of other exemplary distributed precomputation 
algorithms in accordance with the invention will now be 
described in conjunction with the flow diagrams of FIGS. 
13 A through 17B. The algorithms will be described in the 
context of computing primary and restoration paths for a 
single new demand between a source node i and a destina- 
tion node j in the presence of existing demands which have 
already been routed. The first exemplary algorithm, illus- 
trated in FIGS. 13A-13B, 14A-14D and 15, computes two 
link-span-node disjoint paths for each demand, one of which 
is used for a primary path and the other for a restoration path, 
and therefore does not permit sharing of restoration capacity. 
The second exemplary algorithm, illustrated in FIGS. 16 A 
and 16B, computes primary and restoration paths for all 
demands between all source and destination nodes in a 
distributed manner, with the same wavelength required on 
primary and restoration paths, and no sharing of restoration 
capacity. The third exemplary algorithm, illustrated in FIGS. 
17A and 17B, computes primary and restoration paths for all 
demands, but permits sharing of restoration capacity 
between different demands not affected by any single failure. 

The description of the algorithms will make use of the 
following notation: 

V Set of nodes in the network. Each node may generate 
a demand to any other node in the network. Each 
demand is for 1 wavelength but many demands may be 
generated between the same source destination pair. 
L Set of links in the network, providing connectivity 
information. Each link connects two nodes in the 
network directly, 
(p, q) (1) For each link in L, (p, q) (1) denotes the nodes 

at the ends of a link 1 in set L. 
ip^ Link in set L connecting p and q. 
C(p, q), C(l) Capacity of the link 1 connectingp and q in 
terms of number of optical fibers. For some 
applications, C(p, q) is given for each node pair (p, q). 
For others, the only information available is whether 
C(p, q) is required to be zero, i.e. no connectivity, or is 
permitted to be greater than zero. 
N Number of wavelengths per optical fiber. 
R(i, j) For a pair (i, j) of nodes, R(i, j) denotes the set of 
allowable routes between i and j. Note that all links in 
the path of a route r in R(i, j) must either have or be 
allowed to have capacity greater than zero. 
TD(r) For every route r in R(i, j), TD(r) is the set of routes 
in R(i, j) which are node-span-link disjoint with respect 
to r. It will be assumed without limitation that in the 
following description, every r has a non-null TD(r). 
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RP(i, j) A set defined as {(r^): reR(i, j), seR(i, j), seTD ity between different demands. Like the capacity for the 

(r)}. primary route, the capacity for the restoration path is dedi- 

R Set of all allowable routes. That is, R»{R(i, j) i, j eV}. cated to a particular demand. Although this represents a 

G(r, X) For every route r in R(i, j) and a wavelength X, G(r, relatively simple procedure for providing primary and res- 

X) is an index given to the route r for wavelength X to 5 toration paths, it is also relatively expensive from a capacity 

facilitate comparison with other routes in set R(i j) for perspective. Since the restoration capacity is not shared 

that wavelength. For example, the implication may be among demands and failures, the capacity requirements are 

that, a priori, a lower indexed route is favored over a likely to be higher in this embodiment, 

higher indexed route. These indices are either supplied However, a number of factors make this embodiment 

through a central controller or calculated in a distrib- jq attractive for certain apphcations. First, in optical network- 

uted fashion using algorithmic features described applications without wavelength interchange, failure 

below. It is also possible to start with information isolation is generally difficult. Therefore, more complex 

suppUed by the central controUer and modify the indi- algorithms using more sophisticated state and failure set 

ces as distributed information exchanges provide new information may be limited in their effectiveness, and 

information. If neither is available, the route length in 35 improvements in terms of capacity requirements may not 

number of hops may be used to index the routes. justify the additional complexity. Second, since the two 

G(r, X, s, ^) For every pair (r, s) in RP(i, j) and a pair of failure disjoint paths are selected simultaneously in this 
wavelengths K and G(r, X, s, ju) is the index for the embodiment, provisioning and restoration planning may 
pairs (r, I) and (s,/i) for primary and restoration routes both be done in a distributed manner, substantially in real 
between nodes i andj. A route having a higher index is 20 time. With the use of a more complex algorithm, restoration 
preferred. Desired characteristics of the index include path computations may take longer and thus may delay 
the following: (1) a route with more free capacity yields provisioning or leave a demand without a restoration path 
a higher index; (2) a route with a uniformly high for a significant period of time. Third, while primary and 
capacity across links yields a higher index; and (3) a restoration paths are selected simultaneously in this 
route with fewer links yields a higher index. The index 25 embodiment, only the primary path needs to be fixed. ITie 
may correspond to: (1) the minimum, across all links on restoration path, since it is not in use in the absence of 
a given pair of routes, of free capacity for selected failure, can be changed to make better use of capacity while 
wavelengths; (2) the index in (1) divided by the total still satisfying future demands. Finally, as a default, this 
number of links in the two routes; (3) a constant a embodiment does not allow sharing of a transmission 
muhiplied by the minimum free capacity on a primary 30 resource, such as a wavelength in a given fiber on a given 
route plus a constant p multiplied by the minimum free fink. However, this fact is used only by the nodes at the two 
capacity on a restoration path; or (4) a multiplied by ends of each link in calculating remaining capacity. These 
the minimum free capacity on the primary divided by calculations can be modified in an alternative embodiment to 
the number of links in the primary, plus p multiplied by allow sharing and thus to free up additional capacity. The 
the minimum free capacity on the restoration divided 35 information needed to permit this can be communicated 
by the number of links in the restoration. The implica- after the primary and restoration paths are set up so as not 
tions for the index G(r, X, s, ^) are thus similar to those to delay provisioning. As will be described in more detail in 
given for G(r, X) above except that the pairs are conjunction with HGS. 17A and 17B below, such an alter- 
generally more useful when both primary and restora- native embodiment allows fiber utilization close to more 
tion routes are determined for a demand at the time of 40 sophisticated algorithms using additional information 
provisioning, while the single route indices are gener- exchange, and can therefore provide a simpler implementa- 
ally more useful in deciding the restoration route given tion in the shared restoration capacity case, 
the primary route. It should be noted that the wave- FIGS. 13A and 13B show the process steps at source node 
length is required to be the same for the case when the i in the computation of primary and restoration paths for 
primary and restoration routes are required to use the 45 demands d*y from source node i to source node j. As noted 
same wavelengths. Again, in absence of any other above, both primary and restoration paths for a given 
information, the lengths of the routes in number of hops demand are computed at the same time in this embodiment, 
can be used ro provide an additional indexing of the The algorithm is applicable to situations in which the 
pairs. Information subsequently generated by a central primary and restoration paths are computed for a single new 
controller or a distributed computation is then used to 50 demand in the presence of established primary and restora- 
update the initial indices. Hon paths for existing demands, as well as situations in 

FC(1, X) Free capacity on link 1 for wavelength \ as known which many or all of the demands are involved in the path 

to the nodes i and j. When restoration capacity is not computations. In step 220 of FIG. 13A, the value of n is set 

shared, FC(1, X.) corresponds to C(l) times the number to zero, and the route pairs are ordered in accordance with 

optical fibers on hnk 1, minus the number of demands 55 the corresponding values of the above-described index G(r, 

using link 1 and wavelength X for primary or restoration X, s, //). The route pairs having the largest G values will be 

routes. When restoration capacity is shared by demands selected first. It should be noted that only one wavelength is 

with non-overlapping failure sets for at least some involved if the primary and restoration paths are required to 

demand pairs, the free capacity is increased to reflect have the same wavelength. 

this sharing. 60 In step 222, PROBE messages are sent along the selected 

FIGS. 13A-13B, 14A-14D and 15 illustrate an embodi- route pairs determined to have the highest G values in step 

ment of a distributed precomputation algorithm in which 220. Free capacity values FC(1,X) are obtained for each of 

restoration is provided by provisioning two span-node -link the selected route pairs. The G values are then updated in 

diverse paths for each demand, such that one of the paths can accordance with the route selections. In step 224, a deter- 

be used as a primary and the other path for restoration. In 65 mination is made as to whether any route pairs have FC 

other words, this embodiment of the algorithm provides values which are greater than zero on both the selected 

one-to-one restoration with no sharing of restoration capac- primary and restoration routes. If there is free capacity on all 
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links for both routes of the route pairs, step 226 indicates that of attempts is less than Ps^^, the REQUEST message is sent 

REQUEST messages are sent on the selected route pair again along the selected route pair, and an attempt counter 

which has free capacity on all links and the largest G values. is incremented. If the number of attempts equals or exceeds 

When a REQUEST message passes a given link on a A,^ the process returns to step 238 to determine if all route 

selected route, the given link checks if all of the preceding 5 pairs have been exhausted, and then proceeds in the manner 

links have accepted the REQUEST message. If the preced- described above. 

ing links have all accepted, the given link checks to see if it It should be noted that in certain situations the index G 
can accept the REQUEST message. If the link can accept the may not be available, while in other situations multiple route 
REQUEST message, it indicates acceptance by sending an pairs may have the same G value. The latter is equivalent to 
ACCEPT message back to the source node i. The given link lO not having any prior information on which to base a decision 
also passes the REQUEST message to the next link on the among the route pairs. In either case, the following modi- 
selected route. The given link may also start a link timer at fications to the process of FIG. 13 A may be implemented, 
this point. If the given link cannot accept the REQUEST Initially, k route pairs are selected out of the available route 
message, the link indicates rejection by sending a REJECT pairs having the same G value. PROBE messages are then 
message back to the source node, then inserts its current FC 15 sent on these route pairs. Each of the links on the routes will 
value in the REQUEST message and passes it to the next then send its current FC value, with an FC value equal to 
link in the route. Any link which sees that the REQUEST zero implying that the corresponding link has no available 
message was rejected by a preceding link simply inserts its capacity at the designated wavelength. The FC values, from 
current FC value and passes the message to the next link. the links are used to update the G values, and a REQUEST 
The destination node reflects the message, along with all 20 message is then sent on the route pair having the smallest G 
acceptance/rejection indicators and FC values, back to the value. This process is continued until the capacity request is 
source node. If a COMMIT message is not received by a accepted by all links on both routes of a selected route pair, 
given link before its corresponding link timer expires, any It should be noted that the relative congestion status of the 
link which previously accepted the REQUEST message link as given by the FC value is not updated in response to 
simply deallocates the capacity and updates its FC value 25 the PROBE message, but is instead updated in response to 
accordingly. the REQUEST message. As noted above, if a COMMIT 
Every link on each of the two routes of the selected route message is not received within a predetermined timeout 
pair therefore either accepts the capacity request or denies it. period after the acceptance of the REQUEST message, an 
In either case, the links will supply an updated FC value that accepting link assumes that the request it accepted has been 
is reflected back to the source node i. The source node i 30 rejected by another link on the primary or restoration route 
therefore waits for a return ACCEPT message. If there is not and therefore deallocates the capacity reservation and 
free capacity on all links of both routes of the selected route updates its FC value. If all k selected route pairs fail, then 
pair, step 228 determines whether all available route pairs another pair is selected and the above-described process is 
have been exhausted. If all route pairs ha ve not been repealed, until all route pairs are exhausted. As in the FIG. 
exhausted,the value of n is incremented in step 230, k„ pairs 35 13A illustration, if all route pairs are exhausted without 
with the largest G values among the unprobed pairs are getting commitment from all links on both routes in a 
selected, and the process returns to step 222 to send PROBE selected route pair, the process enters the contention reso- 
messages along the k„ selected pairs. If it is determined in lution phase of FIG. 13B. 

step 228 that all available route pairs have been exhausted, FIG. 13B illustrates the contention resolution process 
the process enters a contention resolution phase in step 232. 40 implemented at source node i for a simultaneous search for 
The contention resolution phase will be described in greater primary and restoration paths for a given demand. Since 
detail in conjunction with FIG. 13B below. entering this phase implies that all route pairs have been 
Step 234 involves a determination as to whether the exhausted, there is at least one link on one route of every 
ACCEPT message referred to in step 226 above is received route pair such that the FC value for that link and wave- 
before the expiration of a wait timer. If the ACCEPT 45 length is equal to zero. Step 250 selects the route pair having 
message is not received before expiration of the wait timer, the least number of links with an FC value of zero. In step 
and a REJECT message is received in step 236 before 252, CONTENTION RESOLUTION messages are sent 
expiration of the timer, step 238 determines whether all along the two routes of the route pair selected in step 250, 
available route pairs have been exhausted. If all pairs arc The CONTENTION RESOLUTION messages are directed 
exhausted, the process enters the contention resolution phase 50 to links on both routes which have FC values equal to zero, 
of FIG. 13B as shown at step 232. If all pairs have not been A timer is also started in step 252. Step 254 indicates that 
exhausted, the process returns to step 226 to send after expiration of this timer, the source node i which sent 
REQUEST messages on another one of the selected route the CONTENTION RESOLUTION messages in step 252 
pairs. If the ACCEPT message is received in step 234 before sends REQUEST messages on the primary and restoration 
the expiration of the wait timer, step 240 indicates that the 55 routes of the route pair selected in step 250. Step 256 
G values arc updated, and then the routing tables associated determines if the REQUEST messages are accepted. If the 
with source node i are updated to reflect the primary and REQUEST messages are accepted, step 258 indicates that 
restoration routes determined for the demand. The source the G values and routing tables are updated and the COM- 
node i may use the FC values and other information supplied MIT message is sent in a manner similar to that described in 
by the links on attempted routes to update the G values after 60 conjunction with step 240 of FIG. 13A. If the REQUEST 
every attempt. The source node i then sends a COMMIT messages are not accepted, step 260 checks to see if a 
message along the accepted primary and restoration routes contention resolution request counter is at its maximum 
such that all links on these routes can maintain updated FC value. If it is, the process ends as shown in step 262, with an 
values. If the REJECT message is not received in step 236 indication that the given demand for which the contention 
before expiration of the wait timer, a determination is made 65 resolution phase was called will remain unsatisfied. If the 
in step 242 as to whether the number of attempts equals or contention resolution request counter has not reached its 
exceeds a maximum number of attempts A^. If the number maximum value for the given demand, another route pair 
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with a minimum aumber of links having FC values of zero alternate route, and an alternate wavelength if the primary 
is selected in step 264, the contention resolution count is and restoration routes are permitted to have different wave- 
incremented, and the process continues in step 252 with the lengths. In order to determine which of several demands 
sendingof CONTENTION RESOLUTION messages on the have the most links with an FC value of zero in common 
newly selected route pair. 5 with the demand from nodes i toj, as set forth in step 310, 

FIGS. 14A through 14D show the process steps per- link 1 will need to know the complete routes of any demands 

formed at a link I which is in a primary or restoration path using it as a primary or restoration path. Performance 

probed by source node i. In step 270 of FIG. 14A, a PROBE analysis of actual networks using realistic data may be used 

message arrives at link 1 for wavelength X. The PROBE to indicate whether this additional information is worth 

message may have been generated in step 222 of FIG. 13 A. lO keeping at the links in a particular application. Once it is 

Link I responds in step 272 by appending its free capacity determined to keep this information at each link, it may be 

value FC(1, X) to the PROBE message and passing the possible to further enhance the contention resolution process 

message to the next link in the selected route. itself by providing, to the source node of the demand 

In step 280 of FIG. 14B, a REQUEST message for a selected in step 310 for restoration path rerouting, informa- 

particular selected route arrives at node 1 for wavelength X. is tion on the primary path of the demand requesting the 

The REQUEST message may have been generated in steps reroute. The source node, in selecting alternative restoration 

226 or 244 of FIG. 13A, or step 254 of FIG. 13B. A paths to try, may then avoid paths which include links in 

determination is made in step 282 as to whether the common with the demand requesting the reroute. 

REQUEST message has been accepted by all preceding FIG. 14D shows the process steps at link 1 in response to 

links in the route. If the REQUEST message was not 20 a successful rerouting of the demand selected in step 310 of 

accepted by all preceding links, step 284 indicates that the FIG. 14C. If the rerouting of the selected demand is 

link appends its FC value to the message, and passes the successful, the source node of the selected demand sends a 

message to the next link in the selected route. If the REROUTE SUCCESSFUL message along its earlier resto- 

REQUEST message was accepted by all preceding links, ration route. In step 316 of FIG. 14D, link 1 receives the 

step 286 determines whether the FC value associated with 25 REROUTE SUCCESSFUL message for wavelength X. The 

link 1 is greater than zero. If it is not greater than zero, link FC(1, X) value for link 1 is then incremented by one, and any 

1 appends an FC value of zero and a REJECT indication to routing table entries for the corresponding demand are 

the message, and passes it to the next link in the selected deleted, as shown in step 318. In other words, if link 1 had 

route, as shown in step 288. If the FC value is greater than an FC value which was at zero prior to the reroute, it updates 

zero, step 290 indicates that the FC value is decremented by 30 its FC value to reflect the successful reroute, 

one, appended along with an ACCEPT indication to the FIG. 15 shows the process steps of the algorithm at a node 

message, and passed to the next link in the route. A commit k which receives a REROUTE REQUEST generated in step 

timer is started in step 292. Step 294 determines whether or 312 of FIG. 14C. Node k thus represents the source node of 

not a COMMIT message is received from the source node i the demand selected for restoration path rerouting in step 

before the commit timer started in step 292 expires. If the 35 310 of FIG. 14C. In step 320 of FIG. 15, a REROUTE 

COMMIT message is not received before expiration of the REQUEST message arrives in source node k for a demand 

commit timer, the FC value is incremented by one as shown between nodes k and m using link 1 and wavelength X 

in step 296. If a COMMIT message is received after the FC for its restoration path. Along with the REROUTE 

value is incremented, it is denied by link 1. If the COMMIT REQUEST message comes the above-noted information 

message is received before expiration of the commit timer, 40 regarding the route of demand d^y which triggered the 

link 1 updates it routing table entries to reflect use of contention resolution and the subsequent request for the 

wavelength X by a particular demand d between nodes i and restoration path reroute. Step 322 finds n routes which have 

j. the n largest G(s, X) values of all available routes. From 

FIG. 14C shows the process steps at link 1 in response to among these n routes, the route having the minimum number 

receipt of a CONTENTION RESOLUTION message. In 45 of overlapping links with the route between nodes i andj 

step 300, a CONTENTION RESOLUTION message arrives which caused the reroute request. 

at link 1 for a wavelength X. This message may have been In Step 324, a REQUEST message is sent along the route 

generated in step 252 of FIG. 13B. Step 302 determines if selected in step 322. Step 326 determines whether or not the 

the free capacity value FC(I, X) for link 1 is greater than zero. REQUEST message was accepted by all of the links along 

If it is, the message is passed to the next link in the selected 50 the selected route. If it was accepted, the routing table entries 

route, as shown in step 304. If the FC value is not greater associated with node k are changed to reflect the new 

than zero, step 206 determines whether or not any demand restoration route for demand d^^, and a REROUTE SUC- 

is using link 1 and wavelength X for a restoration path. If no CESSFUL message is sent to link 1. If the REQUEST 

demand is using link 1 and wavelength X for a restoration message was not accepted in step 326, the value of an 

path, step 308 indicates that the required capacity cannot be 55 attempt counter is examined is step 330. If the attempt 

freed up, and the process ends. If there is at least one demand counter value is at its maximum value, the rerouting process 

using link 1 and wavelength X for a restoration path, step 310 ends for demand d^^^. If the attempt counter is not at its 

selects a demand which has a source node other than node maximum value, another route with a minimum number of 

i, which is using link 1 and wavelength X for a restoration overlapping links is selected in step 332, and the process 

path, and which has the most links with an FC value of zero 60 continues with the sending of a REQUEST message along 

in common with a demand between nodes i and j. If several the selected route in step 324. 

demandssatisfy the step 3 10 criteria, such ties can be broken As noted above, a second illustrative algorithm to be 

by selecting the demand which has the maximum number of described in this section determines primary and restoration 

overlapping links with the demand requesting a reroute. paths for all demands together in a distributed manner. Such 

In step 312, a REROUTE REQUEST message is sent to 65 an algorithm may be useful in a variety of situations. For 

the source node of the demand selected in step 310. The example, a central controller may have finished download- 

source node can then attempt to reroute the demand using an ing information obtained from centralized computations. 
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and the information may be in terms of actual routes or 
ordered tables of routes. Alternatively, there may be no 
update available from the central controller, but the previ- 
ously computed routes are to be recalculated in accordance 
with a periodic schedule. In either case, the existing routes, 
along with link contention status information, may be used 
to provide a starting point in terms of search order. The 
algorithm to be described below may be utilized in an 
application in which existing primary routes are to remain 
unchanged while only restoration routes are recomputed, as 
well as in an application in which both primary and resto- 
ration routes are to be recomputed. The algorithm will be 
described in detail for the second application, but it will be 
apparent to those skilled in the art that the algorithm can be 
easily modified to cover the first application. 

FIGS, 16 A and 16B are flow diagrams illustrating the 
second algorithm to be described in this section. In this 
algorithm, as noted previously, primary and restoration paths 
for all demands are computed together, without any sharing 
of restoration capacity among different demands. It will be 
assumed without limitation that initial values of G are 
available at each of the source nodes, and that the available 
G values are each based on a single route and wavelength. 
In other words, it is assumed that the above -described G(r, 
X) values are available. In applications in which these or 
similar values are not available, the reciprocal of the route 
length may be used to initiate suitable values for the index 
G. An altemative approach may be used when G(r, ky s, /i) 
values are available from centralized computation or prior 
usage. As noted previously, the G(r, X) values are generally 
best-suited for situations in which the same wavelength is 
required on both primary and restoration routes. The fol- 
lowing additional notation will be used in conjunction with 
FIGS. 16A and 16B: 

Uy Total number of demands between nodes i and j. 

n,y^ Number of curreotly-unrouted demands between 
nodes i and j. 

Knax Maximum number of routes to be tried at a 
given time. 

^max Maximum of G(r, X) over all (r, X). 

G* G^^-h, where h is a parameter 

Like the algorithms described in other sections herein, the 
algorithm illustrated in conjunction with FIGS. 16Aand 16B 
below may be separated into a first phase, referred to as 
Phase 1 and involving contention-free path determination, 
and a second phase, referred to as Phase 2 and involving 
contention resolution and optimization operations. The por- 
tion of the algorithm illustrated in FIGS. 16A and 16B 
corresponds generally to the Phase 1 portion of the algo- 
rithm. Step 340 of FIG. 16B determines a set {(r, X)} of 
routes for which G* ^ G(r, K)^ G^^. If the number of routes 
in this set is greater than k„^ then routes out of the set 
are selected. The variable k^^; is used to denote the number 
of routes selected. 

In step 342, PROBE messages are sent on all of the routes 
selected in step 340, without specifying any particular 
wavelengths. The free capacity values FC(1, X) are then 
obtained for all of the links and wavelengths on the selected 
routes. In step 344, the minimum value of FC(I, X) is 
determined for all links on the selected routes, and for each 
wavelength. The total available capacity VC(X) among the 
selected routes for wavelength X is then determined as the 
sum, over all of the selected routes r, of the minimum values 
of FC(l, X). F^^-„(X) is the minimum, over all of the routes r, 
of all of the minimum FC(1, X) values. In step 346, the values 
of the G(r, X) indices are updated for all of the selected 
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routes and for each wavelength X. The process then contin- 
ues with the flow diagram of FIG. 16B. 

Step 350 of FIG. 16B determines whether Kei^^mmW is 
greater than or equal to twice the above-described n,y^ value. 
5 If k^,,xFmin(X) is not greater than or equal to twice n,y^, and 
FC(X) is greater than zero, step 352 determines whether 
FC(X) is greater than or equal to twice n-^. If FC(X) is not 
greater than or equal to twice n,y^, step 354 indicates that 
n^y^-FC(X)/2 demands are removed and placed in an 
unrouted set. The remaining demands are assigned to fill 
primary and restoration routes. If FC(X) is greater than or 
equal to twice n^-^, step 356 indicates that a value a is 
defined as In-jJhCQS)^ and a times F„;„(X) demands are 
assigned to two routes with minimum capacity, using the 
route with the least capacity for restoration. This step 
^5 continues until all routes and demands are assigned tempo- 
rarily. After completion of either step 354 or 356, step 358 
indicates that REQUEST messages are sent on the primary 
and restoration routes. If k„,xF^j>,(X) is not greater than or 
equal to twice n,y^, and FC(X) is equal to zero, step 360 
20 indicates that Phase 2 of the algorithm then begins. As noted 
previously. Phase 2 of the algorithm corresponds generally 
to a contention resolution phase similar to that described in 
conjunction with FIG. 13B above. If k^^,,^^^,„(X) is greater 
than or equal to twice n,y^, step 362 indicates that the n,yc 
25 demands are divided into k,^; groups of equal size. A pair of 
disjoint routes is selected out of each of the k^^j groups, and 
is used for primary and restoration path capacity requests. 
The route with the largest minimum value of FC(1, X) over 
all of its links 1 is selected as the primary route. 
30 In step 364, route REQUEST messages are sent on the 
selected primary and restoration routes for each of the k^^^ 
groups of demands. In step 366, the G index values are 
updated for accepted demands, based on the corresponding 
new FC values. Appropriate routing table entries are made, 
35 and a COMMIT message is sent. The value of n^y^ is also 
updated. Step 368 indicates that the rejected demands are 
kept in the unrouted set, while the corresponding G, n,y^, FC 
and other values are updated. Step 370 determines whether 
there are any unrouted demands, or in other words, whether 
40 n^y^ is greater than zero. If n^y^ is greater than zero, the 
process returns to the beginning of FIG. 16B at step 350 
using the updated routes and updates G, FC and other values, 
as indicated in step 372. If n.y^ is not greater than zero, there 
are no unrouted demands, and the process ends. As noted 
45 above, the Phase 1 portion of the algorithm described in 
conjunction with FIGS. 16A and 16B can be executed 
asynchronously by all source nodes. Additionally, source 
nodes may act on multiple demands simultaneously, espe- 
cially in sending PROBE messages. 
50 At the end of Phase 1, there may be a number of unrouted 
demands. Phase 2 of the algorithm may be implemented 
such that changes are not permitted to the primary routes of 
demands which have been assigned primary and restoration 
routes. However, Phase 2 may permit changes in the resto- 
55 ration routes of previously-routed demands so as to allow 
more demands to be satisfied. It is assumed without limita- 
tion that all source nodes know the end of Phase 1. and 
beginning of Phase 2. This may be achieved through simple 
message exchanges or through suitably large timer from the 
60 beginning of Phase 1. The Phase 2 portion of the algorithm 
may be implemented in a manner similar to that described in 
conjunction with FIG. 13B above. The possibility that 
simultaneous search for routes by many demands may lead 
to deadlocks can be handled using one of the following two 
65 approaches. 

A first approach allows asynchronous searches to resolve 
contention and relies on the fact that there is sufiBcient 
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randomness in the start times of the various source nodes The Phase 1 portion of the algorithm illustrated in con- 
such that deadlocks are not created. Of course, the first junction with FIGS. 17 A and 17B is capable of handling 
demand to get a restoration route will lock the capacity out muhiple demands simultaneously, with sharing of restora- 
so others wiU not get it. Deadlock occurs only when two tion capacity. It will be assumed that all demands need 
demands are denied at two different links while there is 5 primary and restoration routes. It will also be assumed that 
capacity to satisfy any one of them by itself. Random time G values are given for route pairs, that is, the above-defined 
gaps may keep this probability small. Other factors allowing G(r, X, s, //) indices are used. Initial values of G(r, X, s, //) 
this approach to work are the fact that it is implemented in may be provided for all disjoint (route, wavelength) pairs 
the Phase 2 portion of the algorithm where most demands between nodes i and j, and for all i and j. The initial G values 
may already have been routed and only a small fraction of lo may be available from prior route usage or may be down- 
the total number of demands are requesting others to find loaded from a central controller. The algorithm could also be 
alternate routes to free LIP capacity on routes with conten- initiated with no specific initial knowledge of G values, 
tion. A second possible approach allows parallel searches Again, n,y will denote the number of demands between i and 
only among demands that do not interfere with one another. j, and n,y^ will denote the number of currently unrouted 
and creates a sequence of Phase 2 searches so that within a is demands between nodes i and j. G* will be used to denote 
search group, there is no possibility of a deadlock. the maximum G(r, X, s, //) value over all available (r, X, s, 
FIGS. 17A and 17B illustrate a Phase 1 portion of an ft). G^ is a parameter, and G is equal to G* minus G^. The 
algorithm in which sharing of restoration capacity between variable k„^ (n,y) will denote the maximum number of route 
demands is permitted. like the other algorithms described pairs to be tried at a given time. 

previously, the algorithm illustrated in conjunction with 20 As noted above, the portion of the algorithm illustrated in 
FIGS. 17A and 17B below may be separated into a first FIGS. 17 A and 17B corresponds generally to the Phase 1 
phase, referred to as Phase 1 and involving contention-free portion of the algorithm. Step 376 of FIG. 17B determines 
path determination, and a second phase, referred to as Phase a set {(r, X, s, fJ)} of routes for which G^ G (r, X, s, /i)^G*. 
2 and involving contention resolution and optimization If the number of route pairs in this set is greater than k„^, 
operations. '^The other algorithms described above generally 25 then k„^ route pairs out of the set are selected, based on 
do not allow two or more demands to share restoration number of links and random selection. In step 378, PROBE 
capacity even when no specific failure can cause these messages are sent on all of the route pairs selected in step 
demands to need restoration simultaneously. Here, the above 376. The free capacity values FC(1, X) are then obtained for 
algorithms are extended to allow restoration capacity shar- all of the links on the selected primary routes and the free 
ing while retaining the advantage of simplicity. The basic 30 capacity values FC(1, //) are obtained for all of the links on 
principle is as follows. The source node of a demand sends, the selected restoration routes. In step 380, the minimum 
at some time in the route computation process, an informa- value fc(r, X) of the FC(1, X) values for all links on each of 
tion message along the restoration route selected for that the selected primary routes is determined. The minimum 
demand. This message contains the primary route informa- value fc(s, jm) of the FC(1, ^ values for all links on each of 
tion for that demand. Every link on the restoration path 35 the selected restoration routes is also determined. The mini- 
retrieves and stores this information in association with the mum of {fc(r, X), fc(s, /w)} is designated fc(r, X, s, //), and 
specific demand. Given this information for all demands fc^^, is defined as the sum of the fc(r, X, s, ju) values over all 
using a specific link in their restoration route, the link can of the selected route pairs (r, X, s, //). The process continues 
easily determine the minimum capacity needed to support with the flow diagram of FIG. 17B. 
the restoration needs of these demands, while allowing for 40 Step 380 of FIG. 17B determines whether fc^^, is greater 
capacity sharing. That is, the links determine which than or equal to the above-described n^y^ value. If fc^^^, is not 
demands using the specific link for a restoration route have greater than or equal to n^y^, and fc^^^ is greater than zero, 
primary paths with disjoint failure sets, and allow capacity step 384 selects fc^^, demands out of the n,y^ demands, and 
sharing in their calculation of required capacity. In many temporarily assigns route pairs (r, X, s, fi) to each of the 
cases, the required capacity will be smaller than the sum of 45 selected demands such that fc(r, X, s, fi) demands are 
demands using that link for a restoration route. TTais differ- assigned to route pairs (r, X, s, //). REQUEST messages are 
ence represents the advantage provided by sharing, and can then sent in step 386 on the route pairs assigned to the 
be used to support more primary and/or restoration routes. demands selected in step 384. Upon acceptance or rejection 
In order to keep the restoration capacity sharing procedure of the REQUEST messages, the FC and G values are 
simple, the information message sent on the primary path 50 updated. If fc^^^ is equal to zero, step 388 indicates that Phase 
can be sent after the restoration route is selected. A disad- 2 of the algorithm then begins. As noted previously. Phase 
vantage of such an approach is that the capacity sharing 2 of the algorithm corresponds generally to a contention 
cannot be exploited to allow the use of the hnk if the link resolution phase similar to that described in conjunction 
would otherwise have to deny the request. However, the with FIG. 13B above. If fc^^; is greater than or equal to n,y^, 
advantage is that the capacity sharing calculations may be 55 step 390 indicates that a value a is defined as In^jjic^^i, and 
performed "off-line." Again, performance analysis of actual a times fc(r, X, s, /i) demands are assigned temporarily to 
network configurations may be used to determine whether it route pairs (r, X, s, fi). REQUEST messages are then sent in 
will be more advantageous in a given application to send the step 392 on the route pairs assigned to the demands selected 
information message with every attempt at getting a resto- in step 390. Upon acceptance or rejection of the REQUEST 
ration route, rather than after getting a restoration route from 60 messages, the FC and G values are updated, 
available capacities without taking into capacity sharing by The set of steps designated 394 is repeated for each of the 
a new demand into account. It should be noted that the demands d selected in step 384. In step 396, a determination 
algorithm of FIGS. 16A and 16B may also be implemented is made as to whether a particular demand d was accepted, 
with the above-noted primary path information message sent If the demand d was not accepted, it is put back into the set 
to links in the restoration path. In such an embodiment, the 65 of n^y^. unrouted demands, as shown in step 398. If the 
information message would generally be sent after comple- demand d was accepted, its temporary route pair assignment 
tion of Phase 1 but before the start of Phase 2. in step 384 is made permanent by an appropriate routing 
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table entry. COMMIT messages are then sent along the rithms in terms of minimizing the capacity requirements, 

designated route pair for demand d. After the set of steps 394 Moreover, the efficacy of the distributed algorithms 

is completed, step 402 determines whether or not all route described herein generally depends on the order of execution 

pairs have been exhausted. If all route pairs have not been by different network elements, and the manner in which 

exhausted, step 404 indicates that the routed demands and 5 deadlocks are broken at various steps. Additional informa- 

their corresponding route pairs are removed from tion can therefore be collected in a distributed fashion to 

consideration, and the process returns to step 376 of FIG. generate correlation information. Also, additional informa- 

17A to process another set of unrouted demands. If all route tion exchanges between algorithm steps can be used to order 

pairs have been exhausted, the Phase 2 contention resolution the steps and to break deadlocks more effectively. However, 

portion of the algorithm is started, as indicated by step 408. lO distributed implementation of these exchanges can result in 

As previously noted, the Phase 2 portion of this algorithm large communication overhead and delay in reaching con- 

may proceed in a manner similar to that described in vergence in algorithms. 

conjunction with FIG. 13B above. These problems are addressed by an illustrative hybrid 

Similarly, the set of steps designated 410 Ls repeated for centralized/distributed embodiment of the invention which 

each of the demands d assigned a temporary route pair in is mixes centralized information collection and computation, 

step 390. In step 412, a determination is made as to whether distributed information exchange and precomputation, and 

a particular demand d was accepted. If the demand d was not real time execution of precomputed routes. The centralized 

accepted, it is put back into the set of n,y^ unrouted demands, information collection and computation is less frequent than 

as shown in step 414. If the demand d was accepted, its the distributed information exchange and precomputation, 

temporary route pair assignment in step 390 is made per- 20 but is selected carefully to provide information that is less 

manent by an appropriate routing table entry, and COMMIT likdy to change frequently and that is critical for the efficacy 

messages are sent along the designated route pair for of the algorithms implemented in a distributed fashion. The 

demand d as shown in step 416. After all of the unaccepted type of information generated and downloaded from a 

demand d have been placed back in the unrouted set in step central controller to the controllers in network elements may 

414, the value of n^y^ is updated, and a set of route pairs is 25 include: 

selected to provide new PC values, as shown in step 418. (j) Connectivity information, such as who is connected to 

Step 420 indicates that the process then returns to step 382 whom in absence of failure 

of FIG. 17B to attempt to find route pairs for remaining ^^^^j ^ capacities. 

unrouted demands. \i . . ^ „ , 

(3) A subset of all possible routes between a source and a 

6. Hybrid Centralized/Distributed Precomputation destination in the network with an order suggesting the 

Embodiments of the invention which utilize a hybrid of search order. The search order can be modified during route 

centralized and distributed techniques will now be precomputation. The order may be specified individually for 

described. Generally, primary and restoration route selection ^^^^ wavelength for optical networks which do not permit 

using global information and centralized algorithms result in 35 wavelength interchange. The subset may be selected by 

smaller capacity requirements compared to algorithms rely- eliminating obviously bad routes from the results of cen- 

ing only on distributed computation using only local infor- tralized computation. 

mation. However, the centralized approaches may lead to (4) A link contention index for every link. This index 

scalability problems for very large networks. Moreover, reflects the number of routes out of the above set that 

centralized approaches typically create a need for frequent 40 contains this link. Instead of a straight count, a weighted 

state updates to the central controller. Besides becoming a count may be given with weights depending on the order of 

computation bottleneck, the central controller may also the particular route in the set of routes for a source - 

become a communication and reliability bottleneck. In some destination pair. 

situations, the difference in capacity requirements between (5) Indices given to nodes, links, and/or routes so the 

centralized and distributed algorithms will not be large. For 45 indices can be used in deciding the order in which distrib- 

other situations, particularly those involving highly con- uted algorithmic actions involving nodes, links, and/or 

strained networks, the capacity difference could be exces- routes should be executed to maximize the efficacy of the 

sive, algorithms. 

In order to further improve the capacity requirement It should be noted that only a subset of information from 

metrics, or alternatively the restorabihty fraction for a given 50 each of the above items need be downloaded from the 

capacity, of the distributed algorithms described above, local central controller to a given node, that subset being the one 

information can be supplemented by collection of additional relevant to that particular node. The details as to how the 

information through distributed communication using 'state information downloaded from a central controller is used 

exchange' protocols. Many of the algorithms described during distributed precomputation of routes will depend on 

herein can be configured to use such state exchanges peri- 55 the specific steps of the individual distributed algorithms. In 

odically and follow them with distributed precomputation of many optical networks, the restoration routes may be 

restoration routes. With this procedure, both state informa- allowed to change but primary routes are typically more 

tion exchange and route precomputation are distributed. The constrained in terms of the frequency with which changes 

state exchanges in limited form allow the algorithms to use can be made. With the large variation in demands due to 

some global information in order to make them more 60 provisioning and reprovisioning in the corresponding ser- 

efiBcient from a capacity perspective. The state information vice network, provisioning and reprovisioning of primary 

exchange may be limited to that needed for the specific and restoration routes in the optical network will generally 

algorithm being used, in order to prevent the scalability and occur randomly, such as from many times a day to once 

bottleneck problems typically associated with fully central- every few days. Since restoration routes are usually allowed 

ized approaches. In many of the algorithms described herein, 65 to change more frequently, distributed algorithms may be 

correlation among state information for different parts of the invoked periodically or when demand changes occur in 

network can add significantly to the efficiency of the algo- order to optimize primary and restoration routes for new 



02/11/2004, EAST version: 1.4.1 



us 6,205,117 Bl 



35 



36 



demands and restoration routes for existing demands. 
However, given the incremental nature of the changes, many 
of the existing routes will still be the most appropriate ones. 
These existing routes can therefore be used lo drive the 
initial ordering of routes in distributed algorithms. Some of 5 
these existing routes may fail in the route testing portions of 
the algorithms because of the changes in demands. It is at 
this stage that the description given above for each algorithm 
can take over and achieve the best selection in a distributed 
fashion. lO 

Less frequently, the entire network may need to be 
examined in order to recalculate primary and restoration 
routes for all demands. Such a process may be implemented 
using completely centralized compulation, since it will be 
relatively infrequent and therefore many of the concerns 
with centralized computation noted above are of less 
concern, or through distributed computation. If done 
centrally, the selected routes can be downloaded and then 
given to the distributed algorithms until the next complete 
network overhaul. Alternatively, the central computation 
may be used only to provide ordering of routes, with the 
ordering given to a distributed algorithm as a starting point 
for route computation. In any case, the distributed algo- 
rithms then take over using the given information as a 
starting point. The foregoing statements regarding use of 25 
limited types of centralized computation in conjunction with 
distributed precomputation are applicable to all of the illus- 
trative distributed precomputation algorithms described 
herein. 

The above-described embodiments of the invention are 
intended to be illustrative only. Numerous alternative 
embodiments may be devised by those skilled in the art 
without departing from the scope of the following claims. 

What is claimed is: 

1. A method of determining signal paths for capacity 
demands in a network which includes a plurality of nodes 
and a plurality of links, each of the links interconnecting a 
pair of nodes, the method comprising the steps of: 

determining at least one signal path in the network using 
a distributed precomputation process implemented by 
at least a subset of the plurality of nodes, wherein the 
distributed precomputation process includes a first 
phase in which paths are allocated for capacity 
demands to the extent possible without violating capac- 
ity constraints, and a second phase in which conten- 
tions between demands for the same capacity are 
resolved; and 

providing link-based capacity control as part of the dis- 
tributed precomputation process to identify demands 50 
contending for capacity on a specific link and to resolve 
contention on the specific link, wherein the link-based 
capacity control involves storing a link status table at 
one or more nodes controlling a given one of the links. 



the link status table listing a number of specific failures 
and demands which are affected by the failures. 

2. The method of claim 1 wherein the link status table 
includes, for one or more of the specific failures, an indi- 
cation of spare capacity on a corresponding link. 

3. The method of claim 1 wherein the link status table 
further includes a link status indicator. 

4. The method of claim 3 wherein a value of the link status 
indicator indicates that spare capacity is available for a 
specific failure and that the given link is ready to allocate 
and reserve capacity for a requesting demand. 

5. The method of claim 3 wherein a value of the link status 
indicator indicates that the link has no spare capacity and 
also that it will not entertain any requests from demands to 
enter a link-based contention resolution slate. 

6. An apparatus for use in determining signal paths for 
capacity demands in a network which includes a plurality of 
nodes and a plurality of links, each of the links intercon- 
necting a pair of nodes, the apparatus comprising: 

a nodal processor associated with a corresponding one of 
the plurality of nodes in the network, the nodal pro- 
cessor implementing at least a portion of a distributed 
precomputation process for determining at least one 
signal path in the network, wherein the distributed 
precomputation process includes a first phase in which 
paths are allocated for capacity demands to the extent 
possible without violating capacity constraints, and a 
second phase in which contentions between demands 
for the same capacity are resolved, wherein the nodal 
processor further provides link-based capacity control 
as part of the distributed precomputation process to 
identify demands contending for capacity on a specific 
link and to resolve contention on the specific link, 
wherein the link-based capacity control involves stor- 
ing a link status table at one or more nodes controlling 
a given one of the links, the link status table listing a 
number of specific failures and demands which are 
affected by the failures. 

7. 'llie apparatus of claim 6 wherein the link status table 
includes, for one or more of the specific failures, an indi- 
cation of spare capacity on a corresponding link. 

8. The apparatus of claim 6 wherein the link status table 
further includes a link status indicator. 

9. The apparatus of claim 8 wherein a value of the link 
status indicator indicates that spare capacity is available for 
a specific failure and that the given link is ready to allocate 
and reserve capacity for a requesting demand. 

10. The apparatus of claim 8 wherein a value of the link 
status indicator indicates that the link has no spare capacity 
and also that it will not entertain any requests from demands 
lo enter a link-based contention resolution slate. 



02/11/2004, EAST version: 1.4.1 



